Enzyme

The enzyme is highly specific for cyanidin 3-O-glucosides and UDP-alpha-D-glucuronate. Involved in the production of glucuronosylated anthocyanins that are the origin of the red coloration of flowers of Bellis perennis [1].

UGAT

Enzyme

Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase

Enzyme name

EC:2.4.1.254

Entry

Transferases;Glycosyltransferases;Hexosyltransferases

Class

UDP-alpha-D-glucuronate:cyanidin-3-O-beta-D-glucoside 2-O-beta-D-glucuronosyltransferase

Sysname

UDP-alpha-D-glucuronate + cyanidin 3-O-beta-D-glucoside = UDP + cyanidin 3-O-(2-O-beta-D-glucuronosyl)-beta-D-glucoside [RN:R09635]

Reaction

UDP [CPD:C00015];cyanidin 3-O-(2-O-beta-D-glucuronosyl)-beta-D-glucoside [CPD:C19762]

Product

Species information

Coffea canephora

Species name

Robusta coffee (Coffea canephora) is a species of eudicot in the family Rubiaceae (madder family).

Species info

Gene sequence info

Coffea_canephora.UGAT.fa

Download
>rna-GSCOC_T00016422001
MQPQISSLLSQLKPHFVLFDFAQEWLPPLASQLGIKTVFYSVFVALSTSYLTIPARLPEAEPTRPPAIEDLRKPPPGFPETSIKSMKTFEARDFLYMFKSFHGGASVYDRVLRGLNGCDIILAKTCREMEDPYVDHVTQQFKKPVLLVGPVVPEPRSEPLEGRWASWLGQFEPKSVIYCSFGSKTFLSDEQVKELLLGLDLTGLPFFVVLNFPANTDISAELKRALPEGFLEKVKHKAVIHAGWVQQQQILAHQSVGCYVFHAGFSSVVEAIVNDCQLVMLPVRGDQLLNAKLVSGDLKAGVEVNRRDEDGYFGKDDIKDAVGRVMADIDKEPAKSISGNHKKWKEFLQNSEIQTKFVSDLVKEMEAVAGLRTV
>rna-GSCOC_T00024028001
MDSKEDVIRILMLPWLAHGHVSPFLELARKLTSRNPNFQVYICSTPVNLSPFRENLAVKFELSSSIQFIDILLPSTSELPPEYHSTKNLPPHLMPALKTAFDGAKDCFLDILKDLKPHLLIYDFLQPWAPAAAQEENNIPSVHFMSCSASIGAFLFHCTEYPDLDYPIPELNFPEITRQELVQFMYNVSNGLTNKERYSQCIEKSTDFFLVKTLSEIEQKHMDYFSEMTKKEVIPVGPLVQEPENRSSDMVFLEWLSKRGPSSVVFVSFGTEYFLSKEEIEVIASGLELSMVSFIWVVRFHGRENVTSLLEVLPEGFQKRVAERGMVVEGWAPQVKILSHPSIGGFASHCGWSSTLESIAFGVPIIAIPMQLDQPLTSRLVAEVGVGIEVRRENGKFREEEIARAIKQVVLQEEGKEVRKKVRELSNKIKEKGDQEIDHVVEKLLQLVKD
>rna-GSCOC_T00016432001
MEFNGGHRLHVVMFPFFAFGHISPFVQLSKKLSLHGVKISFFSALGNVNRIQSMLHSVSAPTTTQVIPLTLPPVEGLPPGLESTADLSPAQSELLKLALDLMQPQISSLLSQLKPHFVLFDFAQEWLPPLASQLGIKTVFYSVFVALSTSYLTIPARLPEAEPTRPPAIEDLRKPPPGFPETSIKSMKTFEARDFLYMFKSFHGGASVYDRVLRGLNGCDIILAKTCREMEDPYVDHVTQQFKKPVLLVGPVVPEPRSEPLEGRWASWLGQFEPKSVIYCSFGSETFLSDEQVKELLLGLDLTGLPFFVVLNFPANTDISAELKRALPEGFLEKVKHKAVIHAGWVQQQQILAHQSVGCYVFHAGFSSVVEAIVNDCQLVMLPVRGDQLLNAKLVSGDLKAGVEVNRRDEDGYFGKDDIKDAVGRVMADIDKEPAKSISGNHKKWKEFLQNSEIQTKFVSDLVKEMEAVAGLRTV
>rna-GSCOC_T00016421001
MQPQISSLLSQLKPHFVLFDFAQDWLPPLASQLGIKTVFYSIFVALSTSYLTVPARLPRSTAYLARPSAIGDLKKPPPGFPETSIKSVKTFEARGFLYMFKSFYSGASVYDRVLRGLNGCDVILAKTCREMEGPYVDYVTQQFKKPVLLVGPVVPEPSSEPLEGRWASWLGQFEPKSVIYCSFGSKTFLSDEQVKELLLGLDLTGLPFFVVLNFPADADISADLKRALPEGFLEKVKHKAVIHAGWAQQEQILAHQSVGCYVFHAGFSSVVEAIVNDCQLVMLPVRGDQLLNAKLVSGDLKAGVEVNRRDVDGHFGKDDIKDAVGRVMADIDKEPAKSIRGNHKKWKEFLQNSEIQTKFVSDLVKEMEAMAGLRTV
>rna-GSCOC_T00034385001
MESSNVLHVVMFPWLAMGHLIPFFRLSKLLAGKGHKISFISTPRNLQRLPKIPQELASQIELVSIPLPEVDNLPKQGETSTDIPHEKDQFLKIAFDLLRSPIASFLENTRPKPDWIIHDYASHWLPEIAAQNGVSAAFFSLFTAAALSFLGRPSALLSGEDGRSTAEDFTVVPKWIPFPSNCAYRLHEVRKNIEDASGNESGASDFVRFAASIDRSDLVIFRTSVEFEPEWFNLVRELYQKPVVSLGVLPPSLDDDDELETDEKWQKIQNWLDKQTASKVVYVALGTEATISQKEVQDLAIGLEQSELPFFWVLRKPPGSKKDVTDMLPEGFRERINANGQGVVYTEWVPQVKILSHPAIGGYLTHCGWNSVIEALGFGRVLILFPVMNDQGLNARLLEGKKVGVEIPRAAEDGLFTSTAVAETLRYAVVSEEGEPMRANARQMTSLFGNGKRNQDYIDTFVRCLEERKISNFLASTSTS
>rna-GSCOC_T00035347001
MFPWLAHGHMSPFLELSKRLAQNNFQVYFCSTEVNLSFIKKDRNLDEYFSDHSIELVQLDLPHFPELPPHYHTTKNLPPHLNPTLHVAFYMGKTNFQSILDTLKPDLLIYDIFQPWASKLASLIHIPYVLFMAVGAVYWSWSYFYDAINKGYSGIDGTYPFPAIVLKDYEIKNSAAILQEFKKNASEEVLLSTTKSFEVSSDIVLLKTWREIEGKYIDHLSSCCGKKIVAVGPLAELKHDDTKKEEEEENSSHLIKFLNDRSESSVVYVSFGSECFLSEEESEEMAYGLELSNADFIWVVRFPVGHAIALEEALPEGFLERVKTRGVVVDGWAPQAKILEHPSTGGFVSHCGWGSFMESIYYGVPLLALPMLYDQPHHARLAVEIGVGIEILRDEDGRIKRENVAKVIKEVVVEKIELGESVKQKAKELSHKLREEGEDELHEAVEKLKSLCSKN
>rna-GSCOC_T00035350001
MEQTSNDESNTRRFRVLMFPWLAHGHMSPFLELSKRLAQNNFQIYFCSTEVNLSFIKKDRNLDEYFSDHSIELVQLDLPHFPELPPHYHTTKNLPPHLNPTLHVAFYMGKTNFQNILNILQPDLLIYDVFQAWASELASLIHIPSVLFLCAGLVCRAWSYFYDVNNKGLSGVDGTYPFPAIVLKDYEIKKLAAFLQEFKKNIPEEVMLSLTKCLEVSSDIVLLKTCREIEGKYIDHLSSCCGKKIVAVGPLIELKHDDTKTEEETENSSHIIEFLNGREESSVVYVSFGSECFLSEEEREEMAYGLELSNANFIWVVRFPVGHAIALEEALPEGFLERVKTRGVVVDGWAPQAKILEHPSTGGFVSHCGWGSFLESIYYGVPLLALPMLYDQPHHARLAVEIGVGIEILRDEDGRIKRENVAKVIKEVVVEKTELGESVKQKAKELSHKLREEGEEQLHEAVEKLKSLCSKIQSQE
>rna-GSCOC_T00034383001
MENSKFLHVVMFPWLAMGHLIPFLRLSKCLAQKGHKISFISTPRNLQRIPKVPHDLASLIEMVSIPLPEGDNLPKQGESSMDVPHEKEQFLKIAFDLLQSPIATFLENTTPKPDWILYDYASHWLPEIAAKNGIARAYFSLFTAATMAYIGPPSILLNEVDGRSTAEAFTRVPKWIPFPLNIAYRLHEVAKHIEDSSGNESGTSDVIRFAAAIEGSDLVVFRTCVEFEPEWFDLVCELYKKPVISLGVLPPSLDEDDELETDETWLSIKDWLDKQSVSRVVYVALGTEVALGQTEVHELALGLEQSELPFFWVLRKSPGSTKDVSQMLPEGFTERINANGRGVIYTEWVPQVKILSHPAIGGYLTHCGWNSVIEALGFGRVLILFPVMNDQGLNARLLEGKKVGVEIPREAEDGSFTSTAVAETLRLAVVSEDGESMRANALKMKNLFGDGNANQSYLDSFVRSLEEKKHILFPGPAVEKGT
>rna-GSCOC_T00035348001
MEQTSNDESNTRRFRVLMFPWLAHGHMSPFLELSKRLAKNNFQIYFCSTEINLSFIKKDRNLDEYFSDHSIELVQLDLPHFPELPPHYHTTRNLPLHLNPTLHVAFYMGRTNFQNILNILQPDLLIYDMFQAWASELASMFHIPAVLFLGGGAVFWSWYYFYDINNKGYSDIDGTYPFPAIFLRDYEIKKMAAFLQELKKNVPEEVVLSMTKGFEVSSDIVLLKACREIQGKYIDHLSSSRGKKVLAVGPLIELKHDGTKMEEETENSSHIIEFLNGREESSVVYVCFGSEYFLSEEEREEMAYGLELSNANFVWVVRFPVGHHCP
>rna-GSCOC_T00035352001
MFPWLAHGHMSPFLELSKRLAKNNFQIYFCSTEINLSFIKKDRNLDEYFSDHSIELVQLDLPRFPELPPHYHTTKNLPPHLNPTLHVAFYMGRTNFQNILNILQPDLLIYDMFQAWASELASMFHIPAVLFLGGGAVFWSWSYFYDIINKGYSGIDGTYPFPAIFLRDYEIKRMAAFLQESKEKVPQEVVLSKTKGFEVSSDIVLLKACREIDGKYIDHLSSSRGKKILAVGPLIELKHDDTKEEEKDENSSHIIEFLNGREESSVVYVCFGSEYFLSEEEREEMAYGLELSNANFVWVVRFPVGHAIALEEALPEGFLERVKDRGVVVDGWAPQAKILEHPSTGGFVSHCGWSSFMESLYYGVPLLALPMLYDQPLQARLAVEIGVGIEILRDEDGRIKRENVAKVIKEVVVEKIELGESVKQKAKELSHKLREEGEGQLHEAVEKLKSLCSKNQSQEQ
>gene-GSCOC_T00038278001
MEENQTISSILMLPWLAHGHVSPYLELAKKLTARNFNIYLCSTPINLSSIRSKISPKFGKSIQLIELNLPTLPNLPPQYHTTNGLPPHLMVTLKEAFEMASPNFCKILRTLMPDLLIYDLLQPWAPEAASYYNIPAVEFITCSATMTSCMLHFFQTPGIQFPYSSTIFFRDYDFKEIEGKYNDYVSCLSGKKVVPVGPLVQDPVHDNEDSTIMEWLNTKGKYSTVFVSFGSEYFLSKEDLEGVAHGLELSSLNFVWVVRFPKGENIIVEEALPKGFLKRVGERGKIVNGWAPQAKILNHSSIGGFVSHCGWNSVLEGMRFGVPIIAVPMHLDQPVNARLLEEVGAGMEVVRGSTGKIHGEDMAEIINQVVKGPRGEPVRKKARDLREKLELKGDEEIDEVVKELVQLCLTKDESNGLHQPIQ
>gene-GSCOC_T00020608001
MEGTLRESNAPRFRILMFPWLAHGHISPFMELSKRLAKSNFQVYLCSTEINLNFMKRSAKFDENSSDHAIEFVQLDLPDLPELPPHYHTTKNIPPHLEPTLKRAFHMAKTNFQNILNNLKPDLLVYDGFQPWASELAALNHIPSVLFLVVGAVNLSFKILAKLQESKAKEGDALDVFKSIELSSDIVLVKSWRGIEGKYIDHLSSSCGKKLVAVGPLSNQEDNGKEADSYSLIIEFLNSKDEASVVYVSFGSEYFLSKEEREEIAVGLELSNANFIWVVRFPVGHAIGLEEALPEGFLERVKGRGMVVNGWAPQAKILGHRSTGGFVSHCGWGSVIESIYYGVPLLALPMHLDQPLHARLAVEIGVGIEILKDEDGQIKREEIARVINEVVVKKTEGKLQGQKAIELSKKLREEGEEELHEAIEKLRSLCSKNK
>rna-GSCOC_T00019614001
MAGKSGSLHVVMFPWLAFGHFIPFLDLSKFIAQRGHKVSFISTPKNVDRLPKIPLEFASSITYVKIPLPRVDGLPENVEATVDLGGLDVAVLKKAYDGLEPELTRFLGYSAPDWIIYDFAPYWLPPIAAKLNISKSFFSIYSAFSMAFLAPPFEALIAGADPRTKVEDFTVPPKWIPFESKVAFKLHESRWMLESRNLDGSGVSDLYRVGSVVKGADVILMRHCHEFEGQWLKLVEDLQARPLIPVGLMPPPVEKSSLENNESWIAIKDWLDGQGKGSVVYVALGSEVSLNQLQLSELALGLESSGVPFFWALRNPSGLPGGFEDRVKGRGIVWKNWAPQLNILSHDSVGGFLTHCGWGSSIEGLMFGHPLIMLPFVIDTGLIARVLEEKLVGIEIPRNDVDGSYTSDSVANSVRLVMVENEGKVFKDKAKEIGAIFGDQDLHNSYLQKSVDYLEKNRNESK
>rna-GSCOC_T00020609001
MGGFVSHCGWCSVIETIHFGVPLLALPMHLDQPLHARHAVEIGIGIEIPKDEDGQIKRQEIARVINEVVVKKKKGQLQRQKAIELSKKLREEGEEELHEAMEKLRTLCSKNK
>rna-GSCOC_T00032060001
MENHATFNVLMLPWLAHGHVSPYLELAMKLTARNFNVYLCSSPATLSSVRSKLTEKFSQSIHLVELHLPKLPELPAEYHTTNGLPPHLMPTLKDAFDMAKPNFCNVLKSLKPDLLIYDLLQPWAPEAASAFNIPAVVFISSSATMTSFGLHFFKNPGTKYPYGNTIFYRDYESVFVENLKKRDRDTYRVVNCMERSSKIILIKGFKEIEGKYFDYFSCLTGKKVVPVGPLVQDPVLDDEDCRIMQWLNKKEKGSTVFVSFGSEYFLSKEDMEEIAHGLELSNVDFIWVVRFPKGENIVIEETLPKGFFERVGERGLVVNGWAPQAKILTHPNVGGFVSHCGWNSVMESMKFGLPIVAMPMHLDQPINARLIEEVGAGVEVLRDSKGKLHRERMAETINKVTKEASGEPARKKARELQEKLELKGDEEIDDVVKELVQLCATKNKRNGLHCYN
>rna-GSCOC_T00029577001
MVLKGWAPQGRILKHPSIGGFVSHCGWSSVMEGIKFGVPIVAVPMHLDQPLNARLVEELGIGEEVVRNKQGILEKEQVSSVIRKVVDEKSTTGERFRRKVRELSEKMREKEEEEIDDVVEEMVKPCRKVDRYNSEDMLF
>rna-GSCOC_T00013821001
MGHLTAFLHMSNKLADGGHKIFFLLPRKTQAQLKQFNLYPDLIDFIPLCVPQVEGLPPGAETTADIPFHLQPNLRLAMDSTQPQIESILQEIKPQVVFFDFTHWLPKLARRLGIKSIFFITMSSATSGYTFRGEQTTDADLMKAPPGFPSSCIKLLSHEARGLNFAGKVKEIGSGLSFLERLLISAEDCDAIGFKTCREMEGPYCEFIERKFKKPVILAGPVLPEQPTTTLEEKWEKWLSGFKAKSVIFCAFGSECRLQKDQFQELLKGLELTGLPFLAALKPPIGSETIEMALPEGFNERIQGRSVVYGGWVQQQLILAHSSVGCFVTHCGSGSLSEGLVNECQMVLLPQFGDQFINARLMGGDLRVGVEVEKGDEDGLFTKEGVCKAIRMVMDDGSEIGKEVRANHAKWRDFLLREGTESSYIDEFINKMKRLLS
>rna-GSCOC_T00029576001
MDTRKRSFRILMLPWLAHGHISPFLELAKALTKRNFYIYVCSTPVNLSSLKQNLSEKDSISIKLVELQVPTLPELPPHHHTTNGLPPNLMPTLKEAVDMAKPSFHNILRTLKPDMVLYDFLLPWVPALASAQNIPAVSFIATGAATFSYVVHYKLNPDSEYPFSSIYYREYENHKLTGMADATASGIKDSDRVINCVEQSSGMILIKTCREIEGKYIDYLSRLSKKKIVPVGPSFRRQQWKTRPRKSSNGLARRIVVPRYLLPLAVNISCPEKIWSRLLLDWSSAMLISYGLLGSL
>rna-GSCOC_T00029573001
MLPWLAHGHMSPFLELAKRLAKKNFHIYLCSTSVNLSSIKNQIAGKYSDSIEPVELQLPCLPDLPPHYHTTNGLPPHLMTTLKTAYEMSAPNFSNILTTLHPDLVMYDFNQPWAAEIASSQNIPAVQFLPFGASMTAFGLYMIKYPGKELPYPEIYIRDYEIAKVRSRDARVNDVSDGQRFLQGLDLSCKILLVKSFKEIEERFMDFLSVASGKKVVPVGPLVQDISLDDIQDEEMEIINWLDQKENASVVFVSFGSEYFLTEDERSEIARGLELSNVNFIWVIRFPFGGKITVEKALPEGFLERVGDRGKIVDGWAPQARILKHANTGAFLSHCGWSSMMESMKFGVPMIALPMNIDQPLNARLIEAVGVGLEPLRDEKGNLQSEEIAKVIRKVLVEESGKNVRRKAKELSEQMEMRGDEEEIDNLVEEVVQLCQKNNGCC
>rna-GSCOC_T00029575001
MEYHQDSFSVLMFPWLAHGHISPFLELAKKLSQRNFKVYLCSTPACLVSIKPKLAENFSASIQLVELHLPTLPGLPPEYHTTNGLPSHLMATLKQAFDMASPNFIKILETIEPDLLVYDMLQPWAPTAASALNIPAVEFISSSTTMTSFMLHVLKNNPGTKFPFSNIFHGDLEAILANKLHDDVKFRSKEINRVVQSLQLSSKIILIKSFKEIEGKYIDYLSLLSGKKVVPVGPLVQDPSSTHGNSDDNLEIMEWLDKKEKKSTVFVCFGTEYFLSQEDREEIAHGLELSNVNFIWAIRYPKGENLQLEEALPKGFLARVGERGMVVDGWVPQAKILGHSSVGGFVSHCGWNSVMESMKSGVPIVAIPMHLDQPVNARLIEEVGAGVEVLREDDGTLGREKVAAVIKQVMHEEIGQLVRERARSLSNKIEVKGDEEIDVVVDELVQLCLEKKMKDVKNF
>gene-GSCOC_T00029574001
MGTEPVAVTSQLKVLMFPWLAYGHISPSLELAKRLTDRGFSIYICSTPINLGFIKQKIAGKYSATIKLVELHLPDTPELPSHYHTTNGLPPHLMSTLKRALNRAKPELSSILKTLKPDLVIYDVTQTWTGALTAAHNIPAVKFLTSTVSMMAYFSHSFMKPGLEFPFPAIYLDPAAERPNRECDRIILTKSSRAIEGKYIDYLFDLTKLKMLPVGTLLEEPIKDDQGDNNDELIQWLGTKSERSTVFVSFGTEYFLTREEMEEIAYGLELSDVNFIWVVRFPLGHKTRPEEALPEGFLERVGDRGRIAEGWAPQAKVLAHPGTAGFVCHCGWNSVVESIEFGVPIIAMPVQLDQPLNARLVVEIGAGIEVVRDENGKFDRKEIARVIKDVVAEEMGENVRGKMRDVSQKIKLKEKQELDEVAELLTQLVSCEGK
>rna-GSCOC_T00014708001
MAAQQHHLMMFPWLAFGHLLPFLEFSKKLATKGVKISFVSTPKNLCRLPPIPADLSDRIKLLAVPLPLVDGLPENCEATIDVQPEQTQFLKKAYDRLAEPMEKLLQQESPDLILVDFAAGWIPETAAKFGISVAFFSAYTAATLAFLGPPGELISGTLRKTAEHFTRPPDWFTFPSLVAHRPHYAPTAFKNLHIPDLSGLSSGQRIAKVVRGCSFVAVKSCKEFEGEYINLVEELYQRPVLPIGVLPPPPETIQESHSDNDSSWSTTFQWLDKQKPKSVVFVGFGSEYKMPIEQIHELAFAVELSGLAFIWILRKPLADTVNLLPPGFLDRTSNQGIVCLGWVPQIEILAHPAIGGCLFHSGWGSIIECLGFGHPLILMPMVYDQTLNAKLLAEKEVGYEVPRDIDGSFSRECVAASIRRVMVEAEGEQIRVKAAQMKNVFGNQDLHDNYINKFIQHLERFKNQV
>rna-GSCOC_T00025868001
MDGITFGVPIIALPMQLDQPLNARLAVEIGVGVEKGCGEDIRKKARELSERIQSNVEEEISAVVEQRSAEPRTRLQIEAY
>rna-GSCOC_T00021289001
MFPWFATGHMTPFLHLSNKLAEKGHRISFLLPNKAKHQLEHLNLHPSLITFYTLTVPHVEGLPPGTETASDVPIFLTSLLATAMDNMRDRVRDLLQKLKPSIVFYDMAHWIPELASEIGFKTVNYNVVSAASIAIALVPSRKPVEDRTITGAELMEPPPGYPSSTVLLRRHEAQGLSFIFLEFGKDITFYDRITIAMKRSHAISIRTCRELEGSLCDYIAREYHKPVFLTGPVLPESEKEDLQEKWANWLKGFEPGTVVFCAFGSQVVLEKQQFQELVLGFELTGLPFLIALKPPFGTTSVEEALPEGFEGRIRGRGIVYGGWVQQPAILSHPSVGCFVNHCGFGSMWESLMSDCQIVLVPHLADQILNTRLLAEELKVAVEVERDNKSTWFSRESLCRAIKSAMDRDSEVGGLIRENHAKWKEVLASPTFMGDCIEKFIQDLQEL
>rna-GSCOC_T00021288001
MPRPEGKRLHVAMFPGLATGHMTPFLHLANELAKRGHKVSYLLTKKAKIQLECGNLYPDVVTFHVLPVPHVEGLPPGTENASEIPIFLNSLFALAFDNMSDQVEAALSDLNPDVVLYDTAFRITDFAPKIGFKTVCYNVVSAASIALALVPARQMPKDRPLTEEELMEPPPGYPSSSVVLRKHEAKVLAFMSSEFGARTFYDRIITALKGCHAIAIRSCQELEGQFCDYIGGQYQKPVFLSGPVLPEQEKQPLDGKWAEWLGKFEQKSVVFCAFGSQIILEKQQFQELVLGFELTGLPFFVALKPPLGTGSIEESLPDGFEERVGGRGVVYGGWVQQPQILSHPSVGCFVNHCGFGSMWESLMSDCQIVLVPHLGDQILNSRLLCGDLKVAVEVERDESGWFSKESLSSAINAVMGPDSEVGSSLRKSHLKVKEILSSPGYMSNYVESFIQNLYEL
>rna-GSCOC_T00033510001
MGTEAVISQLKILMFPWLAYCHIWPTLELAKRLADRGFAMYICSTSSILDSSRKSSADYTL
>gene-GSCOC_T00011692001
MKESNTFRILMFPWLAHGHISPFIELSKRLAKNKFKIYFCSTEINLNFIKESKGFDENSSDHSIQLSIAKLVRLDLPDFPELPPHYHTIKNLPPHLTSTLKLAFRMSKTSFSNILNTLKPDLQIYDVLQSWAAELAALNSIPSPLIIGAVNISFFYHGTNCRVSGTNETYPFSEIFFRDYEMKKIIATYQELTKLESEEAEVFKCFELSSDIVLVKSWTEIEGRYIDHLSLCSGKKVVSVGPLNNQDDDTKEEEEQEDNSDSIKFLNSKDESSVVYVSFGSEYFLSKEEREEIAYGLELSNANFIWVVIFPMGHAVALEEALPEGFLHRVKERGIVVDGWAPQAKILQHPNTGGFLSHCGWGSVMESIYDGVPLLALPMQHDQPLNARLVVDVGFGIEILRDEDGQINREEVVAKVINMVVVEKTKAGELLTQKAREMSNNLREEGEEEWNEAVGKVRNLCRKNV
>gene-GSCOC_T00018811001
MLPILAHGHISPFMELTKKLIDRSIHISIHIYLCSTLINLKPISKKLISIKYTESIELVKFHLPELPELPSHYHTTNELLAHLLPILFYSLKLSNPEIHNIVESLKPDFVI
>gene-GSCOC_T00036675001
MADNRKLHVAVFPWLAFGHLIPFFEVAKFIAQKGHKISFLSTPRNMNRLPTLPPGLVPCFDFVNLTLPRIENLPENAEATMDVPPEDIHFLKKAFDGLEDELTQFLESSTPDWIIYDFAPYWLPPIAARLRISRAHFFVINAWFLDFFGPTSWKRTLETSNKPGGSVTDAYRLGSAISGCDMIIIRHCFEFEPLWLNLLEELHHKPVIPLGLMPPALGSEVGLSQDELTELALGLELSGLSFFWALRRSDSLELPNGFLERVKDRGIVWETWAPQSKILSHDSVGSFLTHCGWSSIIEGLEFGRALIMLPIAVDQGLNARILVDSKVGVEIPRDENDGSFTRNSVAESVKKVTVCEDGQIFRDKAKELSYVFGDKDMHSRYMNSFIEYLENHRPLEIGLILQCSRSQHLLNPKYWAMANEPKLHVVMFPWSAFGHIIPFLELAKFKAQRGHGITFISPPRIIDRLPEIPPIFASSITFVKIPLPRVGLPENAEATMDIRNEDIPHLKKAYDGLEPELTRFLESSLPDWIIFDFAPYWLPTIAAKRGISKAFFSFINSWFLAFLGPSDVMINDADPRSTVEDFIVPPKWVPFETKVAYEPYEINWILGAGQENLTGVSDSSRSGMLMKGSDVIALRQSYDQHPLSAFPSKHSLTIVMLGKETNSLCNMISSCENSCFIQHILKTQKWQTIIVFGLLIYKDIMQYSAMANEPKLHVVMFPWSAFGHIIPFLELAKFIAQRGHEITFISTPRNIDRLPEIPPIFASSITFVKIPLPRVGLPENAEATMDIRNEDIPHLKKAYDGLEPELTRFLESSLPDWIIFDFAPYWLPIAAKLGISKAFFSIINSWFLAFLGPSDVMINDADPRSTNLTGVSDSSRSGMLMKGSDVIALRQSYEFEGQWLKLLEELHQRKVIPLGLMPPQVEKISAEARPGPGHVAAQTWPSWASASGPGQLSWAVSSLGLQKGQGTIPQRSEEIPFGAG
>gene-GSCOC_T00036673001
MGFGARVYQSWVLLQGTQPRSTLTCRSDAQPAQFGSSLRQSHQFPIKEWLNGQNRASVLYVALGSEVPPSQTDISELALELSGVPFFWVLRKPPDFSESESVQLPDKFEERVQVRGMVWKGWVPRLKILSHESIGGFLTHCGWGSTIEGLAFGHPLIMLPFLLDQGLNARTIRSIMVEEEGKIIRDKAKEMSGVAGNKELHDACINKFLELLEDTQHKPKN
>rna-GSCOC_T00036679001
MAAEAKKLHVVVFPWLAFGHMIPFLELAKSIAQKGHRVTYVSTPRNVDRLPKIPSTLISQLNYLKLPLPQIANLPENAEATTDLPITKVHCLKKAYDGLQIEVAQFLETTLPDWLIYDFASHSLPSMVGKLGISLAFFSSMNAWSAAFFGSAKTFHTRTQPEDFLVPPKWVPFPSKVAFRRHELMRMDAGNVENASGVSDWDRVAEALIGCDAILVRSCRELESDWLDLTEKMHEKPVITVGLMPPSARDREDDGDDTWHTISGWLNRHGKESVIYVALGTEVAPSQEELTELATGLELSGLPFFWALRKKEGLSEFESLELPDGFEERVKERGIVWTSWTPQLRVLAHDSVGGFLTHCGWSSVIEGLQNGRPLVMLPFQLDQGLNARALEEKMVGIEVPRDEDDGSLSRDSVAESLRLVMADQKGQVYRDKAKEMKLIFGDKDLQEKYEDNLIEYLEKIRGLI
>rna-GSCOC_T00036680001
MAEDGKLHIVMFPWLAFGHMIPYLELSKLIAKKGHKVSFLSTPRNIDRLPKPPPNLTRHLKFVKIPLPHIENLPENAEATTDLPYNKVKYLKLACDGLQQPISEFLRQTCPDWVLFDFAPYWIPSVASKLNIRTAFFSIFTAPFLGFCGPVEVMKGNGEDRKTPQDFTVKPKWVPFETNVAFKLFEILRLVDSLIGDEEPISDIFRGGSSIENCDFLAVRSCSEFEPEWLQLMEEIYQKPVIPVGQLPTTGNNDADGGKDEAWRPIKEWLDKQERGSVVYVAFGSEAKPSQAEVNEISLGLELSGLPFFWVLRTKRGGEDTEVIELPEGFEDRTKDRGVVCASWAPQLKILSHDSVGGFLTHSGWSSVVEAIQFEKALILLTMLADQGINARVLGEKKMGYPIPRDDSDGSFTRDSVAESLRLVMIEDEGKIYRDKSKEMRRLFCDESRQDGYIENLLNFLQSYKSAKEEK
>rna-GSCOC_T00018815001
MAMPMAFEHPITGRVLVENGVAIEVVRDENGRHQREEIAKVIKEVVFGGAGETMRQKIKDSRKKIKSEEKENLDGLLTLIIQLSKKNSSHDINIARA
>gene-GSCOC_T00036674001
MAELGLGPEPRATVLGRIVTRAPEGSGDHPTEIGGDPPRRGIRAILGKSPNVRRSDSRTGINGNVHHLHKVRSLLDKYSRLFYFPFGTGSSISAPSSLVQLGPGKLSASSISNDGNDSWDSIREWLDVRNKGSVLYVALGSEVSLSQTDVTELALGLELSRVPFFCALRKPSGSTESIQVPDGLEERVKGRGIVWKGWAPQLNILSHDSLGGFLTHCGWSSRIEGHVFGHPLVMLPFLVDQGLNARVMEDRKVGTEIPRNEHTRDSVAESGRLIMVENEGKIFREKAKEMSGIFGDRELHDGYIQKFIDYLENNRHNPMAGFCH
>rna-GSCOC_T00036676001
MDVRTEDVEHLKKAYDGLQPELTRFLEDSVPDMIIYDFAPYWLPEAAAKLGISRVYFCIFNAWFFAFFGPTDMMVDGSDPRKKAEDFLVPPKWVPFKDKVAYKPFEVNWMLSSAERNASGVSDIHRAGKVVAGSDAILIRHCHEFEGAWLNLLEELHQKPFIPLGLMPPPTHVNGSDEKNETWDFISSWLEVQERGSVIYVALGSEVTLNQSHVTQLALGLESSGLPFFWASRKPAGSNEPFELPDGYEERVKGRGLVWKGWAPQMRILSHESVGGFLTHCGWSSCVEGILLGLPLVMLPFLVDQGLNARVLEDNGVGIEVPRDEETGLYTSDSVAESVRLIMVENDGKRRREKAKELSLIFGDRELHISYLENSIV
>rna-GSCOC_T00018814001
MLRMLGHGHISPFLQLAKKLTERGIHIYLCSTPINLNSISKKITGKYSESIQLVEFHLQELPELPSRYHTTNGLPSHLLPIFFNFLTVQS
>rna-GSCOC_T00036677001
MANKSGSLHVVMFPWLAFGHFIPFLELSKFIAQRGHKVSFISTPKNIDRLPRIPPEFASSITFVKIPLPPVDGLPENVEATVDLGGLDVAVLKKAYDGLEPELTRFLEYSAPDWIIYDFAPYWIPPIAAKLNISKSFFCIFSAASMAFFVRSVDAMIAGTDPRTKVEDFTVPPKWIPFESKLAFKLYESRWVVQGQNLEGSGVSDSYRVGSAIKDADVTLIRYCPEFEGQWLKLLEDLLKRHVIPLGLMPPPVEKSIVENNESWIAIKDWLDGQGKGSVVYVALGSEVSLNQLQLSELALGLELSGVPFFWALRNPSGLPEGFEDRVKGRGIVWKNWAPQLNVLSHDSVGGFLTHCGWSSSIEGLMFGHPLIMLPFVADTGLIARMLEEKQVGIEIPRNDVDGSYTSHSVANSVRLIMVENEGKIFKDKAKEISAIFGDQDLHDSYLHKCVDYLENKRHESK
>rna-GSCOC_T00018810001
MYFLSEEEIEEIAFGLELSHENFIWALRSPPGEERKLEQILPEGFLERVQDRGRIVQGGVRQAMILGHPSLGGFLSHCGWNSLSKGIEFGVPIVAMPMAFEQPINARVLVENSVAIEITRDENGRLKREEIVKVINNVVTGCAGEPLRQKMKDLRKQIKSSEKENLDGF
>gene-GSCOC_T00042506001
MFPWLAYGCISLNLELAKKLTGRGFSVYICSTPINLGFIQKKITPNYSASIQLVELHRPDTPELPSYYHTTSGLPPHLLSTLQRALNKIDGKYMDYLSDIMKLKIMPVGTLFPEPVDDDQQDKNTKLIQWLSTKTKHSTVFIAFGSEYFWTKEELEEMAFALELSSVNFIWDVRFPLGQRIRPEEGLPQGFLERTRDGGRIVEEWAPQAKILGHPSIGGFITHSGWNSILESIELGVPIINMPMHFDQPFDARSMVDIGAGVEAVRNDNGKFDRKVTAEVIKNVVVEKMGENLRGKMKEVSEKIKLKENQVFDELVDLLTQLVKENSHPSN
>rna-GSCOC_T00013523001
MAKTNFQNILNTLKPDLLVYDGFQPWASELAALNHIPSVLFLVVGTVNLSSVYHSRRCRVSGANETYPFPAIFYRDYEIKKILAKLQESKAKEGDEFDVFKSIELSSDIVLVKSWREMEGKYIDHLSSSCGKKLIAVGPFSNHEDDSKEADSYSHIIEFLNSKDEASLVYVSFGSEYFLSKEEREEIAIGLELSNANFIWVVRFPVGHAIGLEEALPERFLERVKGRGTVVNGWAPQAKILGHRSTGGFVSHCGWGSVIESIYYGVPLLALPMHLDQPLHARLAVEIGVGIEIFKDADGQIKREDFARVINEVVVKKKEGQLQRQKAMELSKKLREEGEEELHEAIEKLRSLCSKNK
>rna-GSCOC_T00008901001
MASDENIHVVLLPWLAFGHIMPSFQLSIALARAGVHASLVSTTKNIQRLPKLPPDLEGSIDLVGLPLPAIDRNLLPEGAEATIDIPFHKIQYLKIAYDLLKNPFKQFIADQAKSPDWIVADLLTHWAGEVGQQLNIPIICFYPFSAATAVFFGPPEYLAGEGQKRVRSTPESLMRKPEWVDFPSTVAYRKREAIGVHAGFYHENASGIATGQRIAKVIQACKAVAIRSCPEFERDYFYLQEKITGKPAIPLGFLPPETSNNLRDESWNNIFQWLDEQKPIKSVVFVGFGSECKLRKDQIHEIAYGLEVSGLPFIWVLQKPSWGDSDNEDDILPLGFGSRVRGKGITQIGWAPQREILAHPAVGGCLFHAGWGSVTETLQYGHSLVVLPFIVDQGLNSRYLVEKGLAIEVERSEDGSFCKDDIAQSLRRAMVPNVEDEGEALRLLRARAAEAAALFGDRELNGCYIERFVEYLKTET
>rna-GSCOC_T00007453001
MANEPKLHVVMFPWSAFGHIIPFLELAKFIAQRGHEITFIFTPRNIDRLPEIPPIFASSITFVKIPLPRVEGLPENAEATMDIGNEDIPHLKKAYDGLEPELTRFLESSLPDWIIFDFAPYWLPTIAAKRGISKAFFSFINSWFLAFLGPSDVMINDADPRSTVEDFIVPPKWVPFETKVAYEPYEINWILGAGQENVTGVSDSSRSGMLMKGSDVIAVRQSYEFEGQWLKLLEELHQRKVIPLGLMPPQAEKISNDGNDSWDSIREWLGVRNKGSVLYVALGSEVPLSQTDVTELALGLELSGVPFFWGATEAVWINRVNSGARWA
>rna-GSCOC_T00006032001
MADKSSSLHVVMLPWLAFGHFIPFLELSKFIARRGHKVSFVSTPKNIDRLPKIPPNLASSISFVKIPLPPVDGLPENVEATVDLGGLDVAVLKKAYDGLEPELTRFLESSVPDWIIYDFAPYWIPHIAAKLNISKSFFCIFSAASLVFSAPSLDAMVAGTDPRTKLEDFTVPPKWIPFKSKLAFKLHESRFVVRSKNLDGSGVSDMYRVGSAIKGADVTLIRYCPEFEGQWLKLLEDLLQRHIIPLGLMPPPMEKSIVENNESWIAIKDWLDGQGKGSVVFVALGSEVSLNQLQLSELALGLELSGVPFFWALRNPSGLPEGFDDRVKGTGIVWKNWAPQLNILSHDSVGGFLTHCGWSSSIEGLMFGHPLIMLPFIIDTGLIARIMEEKQVGIEIPRNDVDGSYTSHSVANSVRLIMVENEGKIFKDKAKEISAIFGDQDLHDSYLHKCVDYLENKRHESK
>rna-GSCOC_T00006031001
MANKSGSLHVVMFPWLAFGHFIPFLELSKFIAQKGHKVSFISTPKNIDRLPKLPPNLASSITFIKIPLPPVDGLPENVEATVDLGGLDVAFLKKAYDGFEPDLTRFLEDSAPDWIIYDFAPYWLPPIAAKLNIARSFFSIVSASTVVFFGPSFDAMINGTDPRSKVEDYIVPPKWIPFESKVAYKLYESKWIVGASNLDGSGASDMYRVGSVIKGADVVLVRHCREFEGQWLNLLENLQRRPVMPLGLMPPKVEKSGLESNESWIAIKKWLDGQGKGSVVYVAFGSEVSMSQLELGELALGLELSGVPFFWALRNPSGLPEGFEDRVEGRGIVWKNWAPQLNILSHDSVGGFLTHCGWSSSIEGLMLGHPLIMLPFVIDTGLVARILEEKQVGIEIPRNPVDGSYTRDSVAKSVRLIMEEDEGKSYRDKAKEISAVFGDRELHDGYLQRFVDYLEKNVRKST
>rna-GSCOC_T00011251001
MLPWLAFGHFIPFLELSKFMAQKGHKVSFVSTPKNIDRLPRIPPNLASSITFVKIPLPRVDGLPENVEATVDLGDLDVAVLKKAYDGLEPELTRFLEYSAPDWIIYDFAPYWIPPIAAKLNISKSFFCIFSAASMAFFVPSVDAMIAGTDPRTKVEDFTVPPKWIPFESKLAFKLYESRWVVQGQNLDGSGVSDSYRVGSAIKGADVTLIRYCSEFEGKWLKLLEDLLQRHIIPLGLMPPPVEKSIVENNESWIAIKDWLDGQGKGSVVYVALGSEVSLNQLQLSELALGLELSGVPFFWALRNPSGLPEGFEDRVKGTGIVWKNWAPQLNVLSHDSVGGFLTHCGWSSSIEGLMFGHPLIMLPFVADTGLIARVLEEKQVGIEIPRNDVDGSYTSHSVANSVRLIMVENEGKIFKDKAKEISAIFGDQDLHDSYLHKCVDYLENKRHESK

Gene tree

Gene tree

Sheng jun & Dong yang Teams, Yunnan agriculture University