PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG60823.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family MYB
Protein Properties Length: 2915aa    MW: 318162 Da    PI: 6.3935
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG60823.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding57.53.2e-1810131059148
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding    1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48  
                       +g WTteEd+ l ++v+ +G+++W++Ia++++ gR +kqc++rw+++l
       GBG60823.1 1013 KGQWTTEEDDTLRQLVQVHGPKNWSLIAARLP-GRVGKQCRERWHNHL 1059
                       799*****************************.*************97 PP

2Myb_DNA-binding59.76.5e-1910651108146
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                       + +WT+eE+++lv a+++lG++ W+ Ia+ ++ gRt++++k++w++
       GBG60823.1 1065 KEAWTQEEELKLVAAHNKLGNK-WAEIAKLLP-GRTDNSIKNHWNS 1108
                       579*******************.*********.***********96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2915 aa     Download sequence    
MQYVPSVPAG AIVNVHLTVM VNAPQSSWPA TGMPSVVHAS ETLWRAICVM GVTNSRGGNG  60
HCITKLALMS MEQDTGALPR RSARLAGRAR SVIPLRRRTL RLSSRTTPAA SSTALTVPVT  120
TGVLPACPVQ RAGEPLVDYL QWVQAFTDDV ATAKAQEEAE AERQRLAKEA APQAQQTAED  180
DAAARDQRNA AIMESLIHNE NQWTTILEGM XFVPSEEQAD PTPAEAERTH LANVMLTMMR  240
AIMWNNTMLA TRPDAATKSY KMPHFDISKF DDHNKTDALA WWQRFLTEAS CRTVQDDYMM  300
KALYLQLIGG TQAWMNHLAA THTCTIAELH THITWKEFEQ LWFTRFKVRN VVKAAMKEVY  360
TCSQGSMPTR DWTTKWHKIV TMPGFDLTFP NQRSELFSRS CAGLRSALGN EYDYTSFQAI  420
LDRANLVIQT DDKAANEKQS QPHYVANQGY QRPTHNNAVI YEGTKDLHAA AASSSDGGIV  480
AALPPKLYFA PLACEPVSFD VLDTKFNMIL VMSWLRSADH PVNFHDWTVH IRDRNGVLVP  540
CTVATPRTSI ACHVVSVARI RDAIARNDVE EMGLVFLHAL PSPDGPAASP PDPHISHLLD  600
EYTDVFEAPT GTVPDRPIRH GITLEAGAVP RRGCIYHMSE EELEVLRAQL DDLLDKGWIR  660
PSCSPYGTPI LFVWKKNKDL RLCIDYRKLN AQTVKNAGPL PRIDDLLERL GGATYFSKLD  720
LKSGYHQIEI QPQDRYKTAF KTRYGHFEWV VMPFGLTNAP ATFQAAMTTE FRDLLDRSVL  780
IYLDDILVYG RTLDEHITHL RAVLNCLRLA KYKANRDKCE FAKQELEYLG HYVTPKGIRP  840
LADKIQAIVD WSEPRCLMHR GQFAQDVRDP GNEGVLFENM SILDEIESTN DVMMKGVDDD  900
GGDFCLSEVL GAADCPSATE LVGPASWKGT SEEVGTVDGQ QAGSMDVDDA CLEDGVGETG  960
EDRHTHPSGG EAGGGSSEDD GLGKWDDDGS SEKKAGAQRR TEVGVITSAG VVKGQWTTEE  1020
DDTLRQLVQV HGPKNWSLIA ARLPGRVGKQ CRERWHNHLK PNIRKEAWTQ EEELKLVAAH  1080
NKLGNKWAEI AKLLPGRTDN SIKNHWNSTT RRKGIVCILL DCSTPVPTLR TPLYSVNTGT  1140
SQVPHQFRLM SMEQDTGALP RRSARIAVRA RPAVPPRPKR FSRRMTPAAS STTLTVQDTT  1200
GDLPACPVRE ASEPLADYRG RLQAFTDAVA AAEAQQAAAE AERQRLANEA AAEAQRTAET  1260
NEAARNRRNA ASTESLIASE YQWTTILQGM IFVPTETQAE RTQAEAKGSN LATVMLNAKA  1320
SAPPGCTTDA TKQINERIDH VVTIIGDIGV FNGPDTISST VAAIKTDITK LQTRPDAATK  1380
TFKMPHFDIC KFDDYNKSDA LTWWQRFLTE ASCRTVPAND MLKALYLQLI GGAQAWMNHL  1440
AATHKCTIAE LHMHITWKEF EQLWFTRFMV RNVVKAAMNE VYTCSQGNMP TRDWTTKWQK  1500
IVTTPGFDLT FPNQRSEFFS RSCAGLRSTL GNEYDYTTFQ AILDRANLVI QTDDKAANER  1560
QSQPHYVAKQ AYQRPTHNNA VISEETDDLH AAAASSSDRG NCSSAPAEAS QESSEEQGDT  1620
RDSIDGNWAA AMDXLQDNQG FYFAPLACEQ VSFDILDTKF DMILGMSWLR SADHPVNFHD  1680
RTVHIRDRNG VLVPCTIATP HTSIACYVVS VARIRDAIAR NDVEEMGLVF LHALPSPDGP  1740
AASPPDPRIS HLLDEYRDVF EAPTGTVSDR PIRHGITLEA GAAPPCGCIY CMSEEELEVL  1800
RAQLDDLLDK GWIRPSCSPY GAPVLFVRKK NKDLRLCIDY RKLNAQTVKN AGPLPRIDDL  1860
LEQLGGATYF SKLDLKSGYH LIEIQPQDRY KTAFKTRYGH FEWVVMPFGL TNAPATFQAA  1920
MTTEFRDLLD RTVLIYLDDI LVYTKYKANL DKCEFAKQEL EYLGHFVTPK GISPLADKIQ  1980
AIVDWPEPRC STDVRSFMGL AGYYQRFVES YSKVAAPLSR LQSPKDEGMR GSGRGARSRP  2040
DILQKYIMQL KAQNRERHQQ AELRRLSQAQ LAQHQDLHSS NPSDLQLREK TDAMLANLRK  2100
QQGPWGPSFV DGSLPSRSTM MFGGMAGQMS LFQDLMMASV QRNCEKGNFM QPNGRGVPGA  2160
TKEALHQQHQ LEPVVSSAMN SQLPTLGMEQ RPCGTSFSDI GRHSGIGHAV GGLGGVTRPS  2220
AEQAAVSAPT SVDFATSLCL GAGGPALGRE VNVSGMATAG LANAGAMTDV SGVSVMDLQQ  2280
QEQQKQQQQE NLNEQQLRSL QQHMSVCGQG SALAVQTDTS LPMLGGSTLA SILLSKGVSS  2340
IRPMKCAEET SADPGMLAGQ PSMTNGVSND RLWDLDSVRG EGYPCGTRPL PQGKLGVEPL  2400
GSNGLISGSA SDLNRQVNQD LVSKARPQQV IDSMGSASER QGCQQLSKQS AGGSLALPLQ  2460
QFPMPVCGSL RKRPRQIGPN GYGSSGCTLP GGDVSSAMRI DTDDAAALFG SSSTLGTSRA  2520
TVSSISSGGT TSAGSGICSL TQSGGRAARP FEAQATPVQM PGLGLPCTRV PESDQCLSPT  2580
VPGEPVNAIS KSDGNCPAAV SPPSGCPPSS NVCRNRSIIK RCKQETRSIP AHGSLEGLLQ  2640
ALGRNSPPPL ESEDEDSDYG IANGGQSEQE RNIIMESSTG AANPSTLAPL VPSKEEMTNL  2700
ESEVAAEDIL MKLLLASSSG RGLNESGAQK EMGRWNTGLP NMPIPVPMPL PAGSTGKPAD  2760
VDTTIHRPMN LIVPENPLLT CVRAGIGMRM PTSLGSRSTA APSANASSLA PLLSSPRDSG  2820
NTGTGTGYWD PCQLHLVQTF MSPPPMFSSG LVLGGSMGGN LPNNTVSNAI GSSGFDTFSS  2880
LLRMMAEDTL PSLYDPSANP PPSNTDPVAC GWRAN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
124702476LRKRPRQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G27785.12e-43MYB family protein