PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74571.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family MYB
Protein Properties Length: 1544aa    MW: 161251 Da    PI: 6.8853
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74571.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding30.86.6e-10240948
                     HHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding  9 dellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48
                     de l +av+ + g++Wk+Ia+ +   Rt+ qc +rwqk+l
       GBG74571.1  2 DETLRRAVQCYKGKNWKRIAEFFA-DRTDVQCLHRWQKVL 40
                     799*********************.************986 PP

2Myb_DNA-binding62.86.7e-204692148
                     TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48
                     +g+WT+eEde+++++v+q+G + W+ Ia+ ++ gR +kqc++rw+++l
       GBG74571.1 46 KGPWTKEEDERIIQLVNQYGAKKWSVIAQNLP-GRIGKQCRERWHNHL 92
                     79******************************.*************97 PP

3Myb_DNA-binding58.51.5e-1898142147
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                      r++WT+eEd+ l++a+ ++G++ W+ Ia++++ gRt++++k++w++ 
       GBG74571.1  98 RDAWTEEEDLALIRAHFLYGNK-WAEIAKCLP-GRTDNSIKNHWNST 142
                      789*******************.*********.***********976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1544 aa     Download sequence    
MDETLRRAVQ CYKGKNWKRI AEFFADRTDV QCLHRWQKVL NPDLVKGPWT KEEDERIIQL  60
VNQYGAKKWS VIAQNLPGRI GKQCRERWHN HLNPNIKRDA WTEEEDLALI RAHFLYGNKW  120
AEIAKCLPGR TDNSIKNHWN STMKKRVDSG VFNADDPISV ALAATHGHPD ATLVGNPVQM  180
GVAAVSSSSS LGVESSLSSR IQKPVQVLSQ NAGVVGAEAH HPTTSLSMVI GGSGGMQSSS  240
LNRFSAQALC PNQVQKTSCS VQTSATDGVA FGGNCCPGKS DAAVSAGDDS EDAASVDGGL  300
KKPNNVRKEL LPNKVISRPP RPVPVLRPGV KINNDHLSTA TPAPPPPPVP PPPPPPPPVP  360
PPPVPHVSSS ALLPPIPSCC GSAAPASSIA LSAVQIPQAP VPTAMSLTGY LHNGAGTISG  420
MGLPSIPMAI PLPKSLPFSA SSGYPTPSGG PSGSGPLQMS ATIPSMGHSA VPSQSTCFPS  480
GMAPSSIQGP YHTVMTLHPT QGVAQGQHQI MGSHHHHHHH HHHHVVVGPF ASPAGQQGIR  540
ALATSTGTSS DGLGMVAMSM KSGTEFPPRM ASGGQSSMSL GMAHSQGIDS SLGYGMSRIG  600
QLVSVPQPQR AQDNFENGHQ AMTMPAEVAV VTSTGAEGPV SRLDVSVSSA MVPVEDQGSG  660
EPSSTGGQND NHEACTDELL RAASEVVTEY KYMNILHSMP DGGCAELGDS NEDVLFYEPP  720
KLNPMDLPFL EYDLSSPTDV SRKAFSPLGV SQMIMPAMCS PLLYGAFSPF AGQSPQSKLR  780
SAAQSFCSTP SILRKRRRKL DVTGGNGEGQ EGNLLLLQYG ESKKGVKAED KQGTADIVEQ  840
TREKEGEGTC GASSVPGQSE SPTTPDASSV SGDLEESTGK PAASVAPTVR PVMVSPSYNY  900
HNRDAMLQPC SASDSPSLLG LKGQVKGKDG SFQYRQITCR TPSKGSSGTG GSGGGAAGGS  960
TGWSNGNGNA TVKPGGEGSG QEGSGDATNG GGNSEGGARN AGGAVSKAMC MAGALKDCGR  1020
QERNVKRPRS FGSEGWAKAK NAGEDFQVRK KPDLGVLKNL PSAALSPKVA VLVEQRVNSQ  1080
TVLPEVLLRG TTAALQTDPP AGTGCAGASG GILTGPSAGV PCLSTVGAGP RVSKVVSPAT  1140
VSSTAMQRVG GTGVADGNSS ETEGCRMPGN EWVYCAVTDA PITCSPTGNP PDIVASGTCL  1200
GSAAGNENNY TGSPSGIWKP WPSSDAAKDC ATMKEGGANQ LIASGAIGRT SFSEYPNIGL  1260
LLEEGNDALG IMRHLNEHAS SAYSEAEDVL ARTARGDPSD RSVVEDGSPG SFKENQRGRS  1320
SSGFSSVSLN DCSSSYAGWL SPFGGLFSPD VYDIAITPLK HSSAADDSSN LLQPQPDPFS  1380
PSISWAAFAT TAFAMTMCSG LWASRNLGNL METALSPIDV LCRHSGWLLV KKEQRESESR  1440
KTAITELRKM AHKGGRKNGV VGEKDGGWRE AKEAERGRRR AEEEERRGGE EGEEEEAVRA  1500
VGVLHSKSGS GKSVFCSEDC HCKHEGVALW DMELDLNCYR CQSV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1793800LRKRRRKL
2794798RKRRR
3794827RKRRRKLDVTGGNGEGQEGNLLLLQYGESKKGVK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G02320.22e-77MYB family protein