PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG016203t2
Common NameTCM_016203
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB
Protein Properties Length: 442aa    MW: 50034.6 Da    PI: 8.3703
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG016203t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding41.72.6e-13745948
                      HHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
   Myb_DNA-binding  9 dellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48
                      d ll +avk+   ++Wk+Ia+ ++ gRt+ qc +rwqk+l
  Thecc1EG016203t2  7 DNLLTEAVKKCKARNWKKIAEFLP-GRTDIQCLHRWQKVL 45
                      889*********************.************986 PP

2Myb_DNA-binding568.8e-185197148
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
   Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48
                      +g+WT+eEd+ + ++v+++G + W+ Ia+ +  gR +kqc++rw+++l
  Thecc1EG016203t2 51 KGPWTKEEDDCITKLVEKYGCRKWSVIAKFLR-GRIGKQCRERWYNHL 97
                      79******************************.*************97 PP

3Myb_DNA-binding52.11.6e-16103145145
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
   Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                       +++WT+eE+ +l+ +++ +G++ W+t a+ ++ gRt++ +k++w+
  Thecc1EG016203t2 103 KDSWTEEEEAILAYYHQIYGNK-WTTLAKLLP-GRTDNAIKNHWN 145
                       679*******************.*********.***********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5129414.504145IPR017930Myb domain
SMARTSM007173.5E-6647IPR001005SANT/Myb domain
SuperFamilySSF466891.22E-15659IPR009057Homeodomain-like
CDDcd001675.52E-10745No hitNo description
Gene3DG3DSA:1.10.10.603.6E-18757IPR009057Homeodomain-like
PfamPF002496.7E-11745IPR001005SANT/Myb domain
PROSITE profilePS5129426.45146101IPR017930Myb domain
SuperFamilySSF466898.24E-3148144IPR009057Homeodomain-like
SMARTSM007179.6E-165099IPR001005SANT/Myb domain
PfamPF002491.1E-165197IPR001005SANT/Myb domain
CDDcd001671.13E-145397No hitNo description
Gene3DG3DSA:1.10.10.607.2E-2658104IPR009057Homeodomain-like
SMARTSM007173.3E-15102150IPR001005SANT/Myb domain
PROSITE profilePS5129419.016102152IPR017930Myb domain
PfamPF002491.3E-13103145IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.607.9E-22105152IPR009057Homeodomain-like
CDDcd001678.17E-11105148No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 442 aa     Download sequence    Send to blast
MIMVGQDNLL TEAVKKCKAR NWKKIAEFLP GRTDIQCLHR WQKVLNPGIF KGPWTKEEDD  60
CITKLVEKYG CRKWSVIAKF LRGRIGKQCR ERWYNHLDPT IRKDSWTEEE EAILAYYHQI  120
YGNKWTTLAK LLPGRTDNAI KNHWNCTLKK KLGFYSPHRY AVDICNDGSS DFSDQETTPK  180
CLKVKEERQG LDETVSVYPN IDVDYSVDRC YLDLVLGIAN QTETKPEADS GKFEKCWSAG  240
VPNEQITPLK RVHFDDKVNS TTEDSLIRSV RGNAKHAKIH EPQSASCRVA SEDTQALLPS  300
TSVGSPLSSL TFKFGEDNGQ VDKESTNQRM HTAYASGCLL HNEPSQPKDS TSAIIPIVDN  360
QDMHIKSSFC YSAPPKLVGS RSLNSGSPES ILRISAMTFK NPSIIRKRSY KKAWNDNFSD  420
AACSPARTFT CFHWEEVNGT Y*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C1e-63615213159MYB PROTO-ONCOGENE PROTEIN
1h89_C1e-63615213159MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007040158.10.0PREDICTED: uncharacterized protein LOC18606476 isoform X2
TrEMBLA0A061G4B00.0A0A061G4B0_THECC; Myb domain protein 3r-5, putative isoform 2
STRINGEOY246590.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G02320.26e-73myb domain protein 3r-5
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]