PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG61835.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family MYB
Protein Properties Length: 3081aa    MW: 331362 Da    PI: 5.6816
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG61835.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding34.45.3e-11351148
                     TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT.TTS-HHHHHHHHHHHT CS
  Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmg.kgRtlkqcksrwqkyl 48
                     +g+W ++Ed +lv +++q+G+g W+++ ++   + R++k+c++rw ++l
       GBG61835.1  3 KGPWDPQEDAILVSYINQYGPGKWNLVRAKGFdLARCGKSCRLRWVNHL 51
                     79************************9544336*************996 PP

2Myb_DNA-binding41.33.7e-1358100246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                       ++T eE++l  +++ ++G+  W++Ia+ m+ gRt++++k++w++
       GBG61835.1  58 TPFTVEEQQLVFELHGRFGNQ-WARIASQMP-GRTDNEIKNFWNT 100
                      68*****************99.*********.***********96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 3081 aa     Download sequence    
MKKGPWDPQE DAILVSYINQ YGPGKWNLVR AKGFDLARCG KSCRLRWVNH LQPNLKKTPF  60
TVEEQQLVFE LHGRFGNQWA RIASQMPGRT DNEIKNFWNT KHRKLKQKKV NGEKGSSDGA  120
TNAGAVGGGP GAVTASTGTN TAESSGAGPG AGTTSTGTNT AESSGAGGGS GGARGGEKGG  180
ERGGERGGER GHHPGKMSNG TGSSSGRSTH GPRSKSSGGG GSSHAAASAA VTSLENGGGG  240
GGSGHHHSHM RGSRSHQHHH GSHGWSSLGS ASSEGGGGGG GGGGGGGGGG GGGGGGCGFP  300
KVPSTNSATA YLITHSGVKG SNSSSNLHAH GKQQQQQLHP HQNAGAGFSS LDITVGTCQH  360
SGGAGCVLGS PSLSTVLDPF DGLHEESTRH LVDDNDATGA AAFETATSSR AGIARVSRGV  420
ETCTDLEQSF SAWEEDNFPG HGQGRLHQGP TMVSAATQAE SHSSSGDKEQ VESMVEVVGK  480
PACKGIAAAA VAAMGTYSGA SLSGKSGLMS HTAGVTTGSG GGRSGGGLRK REASSQGGVG  540
GIANCVAGGH LSGTKAVGSP SKVRKRLSSS AAAAAAAAAA AAAASASASA TPAPPSAALI  600
TATTGMTIMP SANSPRRIII RRNDRQLAAT STQQSQQQQP QQQPQQQQQQ QQQQQQQQQQ  660
QQATSACHLS TTPVSSLSGV QPPAIPADVF GLWTAAGTLG GAGMVVGGGL EDFCAQSCEW  720
ATAGVGPAAH AFDEPTLSVG TETGKLVWVD PGPMSGHKDG CLYPGPEGDG WISLTEAGII  780
STNSFSQLDS SVPTPTMLAL ENRSPGSGGA WGSGGAKFNP LSENEMAKLR RGNIVRNFVT  840
AGQYAGDQSK MHGDRKLRCN LCGHLFQGGS SKAARHFTQA KFCKAGEMRV LAELWNGTNY  900
TFMPSTAQRV QRWMADEGIR DTRAPASGQR QRMDDAERDE IQDALDEEGH EGGAREGGVA  960
DEGDEGVGEP NLQGRRWEKR ARQTTIDEMY AREKLAEFSD AWLQWVYVKK LPFNAFRGPE  1020
FQRVQQAAER VPRSIQFRFP SYRVTAGVGI PSQRAKVATI MSKVRATFRH SGATILSDGR  1080
KSRSGKPLVN FLAGGANGAL LYATVARDGS VRDTADVVYR RWRAIILSVP AKDVIGFCTD  1140
SASNYTAAAR RFATDPEPDI RRITWLPCST HVCNLMLSDI GTRVGWVKET IIHARALVRF  1200
IKSHGAAHTL FRKVQWVQQQ IRDREFWQRV QYAILGMSPV HQLLRRMDRG GMMMSVVYEW  1260
SQHLLQLMRR VDVPTDMIEP CVREVAICNL HMLEPAHAAA HLLNPRRRSL TYYHSLETTA  1320
DDRRVVEECD IFLLAQTGGD PVGRLYRSVR DQMRAFHSRR GDWGDHDLSD AEAVNCRGDS  1380
GTERCAAWWF EHGRAHPELR TIAIRAEHER VSTTRRCKLG FAKLAQLVEI ATNLQLASCA  1440
RQGGGYVLPW VMGTGRGGTV AEEEEEEGDV EPEVWGAQPD GSVLEQEIQQ QIVAFHDSRP  1500
SRARSVRDMF GSKATELRPW PEGGDDVDAA AADDHIDDDW TDDDDTPLSR DPTAEQVYFT  1560
YGGGHDGMDS FTSVITGDVP STAQASGSSR AGGGHGGHSH AEVRVDDSGE QQPRGGLRQT  1620
GRRWEVRSDS EAEGEAEDEE VPLRDRRFSP SHHPISAQPE ALRRLERLAS TAGRDRTHDG  1680
EDRGGERTPS DAGIRPRSPS VLRTQDFDVV GLGGSLGDFS VEGRRHASPQ EVRTDADIER  1740
GPVETEEERD ARLDREEEER LRSLPQWEGR FAYLDEQCPQ RDLETGGGGG RDVDDFGVPE  1800
EAVQGEFDYE GQDAAAGGGS GGGRRGDEET AGVPEGQDVV MGGGSPSGGR GDSETAGDEG  1860
QGATVCGRGE GGPGGDGDTA GVGGDDGPDD DDDDDHGPED DPYRLALVLR DPTVPPLRPE  1920
DTAHTFFDVD ALAQALTDDP FAHVSRKSGK QRAPGKVYSP PMFVVLSPHW SGSSLSGVRG  1980
RESGAGEVGF GGRSDGGRDS VGSMPPPPAQ TPEARVDTGD GTAAPGVAHS DSSPPTVRGG  2040
VGGGWSAVRR VGDRLRADYD AGRGVFTGCT AVQTQAAEGG SGRPTLQMAG RMLGLSRAET  2100
RRSLVLEAAG TSADRLRGEQ FTSGEEGLLM RPGTRRQHGL SEVEARLAAG VAAGQAALDA  2160
IQRERERGIA AQAEEDEDAD TESEPIETAA RTHRAQVAAA AAAAHAAYVS GERGTGAGGH  2220
MAQVPEGEEG AGEDRMAARP PGKAARALVI MSSRKYEASL VKRHFDVLSP AGKDPSKGGK  2280
WWVCHYCDLK FSGNLNRLKE HFTKGICAVE KSTKTLTAEE ILRRKEGLRQ ARTDSGMARD  2340
GANALPVVTY LVAASTGGGL VEGVGDACGE AAGVACGEEN EFFNDMVDAI REAHPSWRLH  2400
SREAARNGRL NGQHQRAIED VGRLAKKWDR TSCMVQVDGW SDRRGRPHVN VMVSSPIGTV  2460
FWKSVCIAGK DKDACAYYRI LSTAIEEIGP RSVVGVIMDN ARVCVKAGEL VEKKYPWIFR  2520
VGCTEHALDL ALEDMDKRIL WFAETVKRGN VLGKFVMNHD KVRALFLTKS GGVEIKRPGV  2580
TRFATNILML QSLWDRANAL KLTLAASGWV SSLVPPHLRS QFEEVTATIL DGGFWERVDK  2640
ALRTTKLVDG PGATIAKVYH RMNGVVESIH KLDILADFEK AEVESILMDR WAFMTSELHC  2700
AAAFLDPEYR SQGSFRDTEI RAGFNIWLYT WCTDEMFDDV SWQVDEWVHL KGALQSEEAL  2760
RAARTKTPAM WLDVFASRLH LLQPQAIRLL GQASSSAACE RNWSLHELIF GRRRTRILPT  2820
RLSKLVYNNW NLHLQWRQER GRGVNDVHIL WVEEMSDAEL KEEAEQFYND WVKRVKNSMP  2880
SGENDTSDEE GELEDDDDGE EVPMARRWLR NDKVDNAMDQ EEGLAHIDKY TDDWHFNTRS  2940
GRQLERALRR LSGHERPTPE VQYNKEDREL ALRARETGDC VEGRKEGGSS RQRKMRRGDE  3000
GRRHASSEEG DDEGHDANDG EPRSVEEGTE HEEEEEEGEE EEEGEEEEEG EEEEEGEEEE  3060
EGEEEEEEEE GETSARGGAE D
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1520527GGGRSGGG
2521528GGGRSGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G26930.13e-39MYB family protein