PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG016203t1
Common NameTCM_016203
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB
Protein Properties Length: 497aa    MW: 56215.1 Da    PI: 7.5899
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG016203t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding57.33.6e-1854100148
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
   Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                       +g W++eEd ll +avk+   ++Wk+Ia+ ++ gRt+ qc +rwqk+l
  Thecc1EG016203t1  54 KGGWSEEEDNLLTEAVKKCKARNWKKIAEFLP-GRTDIQCLHRWQKVL 100
                       688*****************************.************986 PP

2Myb_DNA-binding55.81e-17106152148
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
   Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                       +g+WT+eEd+ + ++v+++G + W+ Ia+ +  gR +kqc++rw+++l
  Thecc1EG016203t1 106 KGPWTKEEDDCITKLVEKYGCRKWSVIAKFLR-GRIGKQCRERWYNHL 152
                       79******************************.*************97 PP

3Myb_DNA-binding51.81.8e-16158200145
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
   Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                       +++WT+eE+ +l+ +++ +G++ W+t a+ ++ gRt++ +k++w+
  Thecc1EG016203t1 158 KDSWTEEEEAILAYYHQIYGNK-WTTLAKLLP-GRTDNAIKNHWN 200
                       679*******************.*********.***********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5129419.19949100IPR017930Myb domain
SuperFamilySSF466893.88E-1751107IPR009057Homeodomain-like
SMARTSM007173.8E-1753102IPR001005SANT/Myb domain
PfamPF002493.7E-1654100IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.0E-2256116IPR009057Homeodomain-like
CDDcd001672.76E-1457100No hitNo description
PROSITE profilePS5129426.451101156IPR017930Myb domain
SuperFamilySSF466899.9E-31103199IPR009057Homeodomain-like
SMARTSM007179.6E-16105154IPR001005SANT/Myb domain
PfamPF002491.2E-16106152IPR001005SANT/Myb domain
CDDcd001671.90E-14108152No hitNo description
Gene3DG3DSA:1.10.10.601.1E-25117159IPR009057Homeodomain-like
PROSITE profilePS5129419.016157207IPR017930Myb domain
SMARTSM007173.3E-15157205IPR001005SANT/Myb domain
PfamPF002491.6E-13158200IPR001005SANT/Myb domain
CDDcd001671.20E-10160203No hitNo description
Gene3DG3DSA:1.10.10.609.3E-22160207IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 497 aa     Download sequence    Send to blast
MVEVKKEKDE YYEDLAEEDI SASRSSFPDS SCDTAMRSAS LQGMVTGPTR YSRKGGWSEE  60
EDNLLTEAVK KCKARNWKKI AEFLPGRTDI QCLHRWQKVL NPGIFKGPWT KEEDDCITKL  120
VEKYGCRKWS VIAKFLRGRI GKQCRERWYN HLDPTIRKDS WTEEEEAILA YYHQIYGNKW  180
TTLAKLLPGR TDNAIKNHWN CTLKKKLGFY SPHRYAVDIC NDGSSDFSDQ ETTPKCLKVK  240
EERQGLDETV SVYPNIDVDY SVDRCYLDLV LGIANQTETK PEADSGKFEK CWSAGVPNEQ  300
ITPLKRVHFD DKVNSTTEDS LIRSVRGNAK HAKIHEPQSA SCRVASEDTQ ALLPSTSVGS  360
PLSSLTFKFG EDNGQVDKES TNQRMHTAYA SGCLLHNEPS QPKDSTSAII PIVDNQDMHI  420
KSSFCYSAPP KLVGSRSLNS GSPESILRIS AMTFKNPSII RKRSYKKAWN DNFSDAACSP  480
ARTFTCFHWE EVNGTY*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C7e-66542076159MYB PROTO-ONCOGENE PROTEIN
1h89_C7e-66542076159MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007040158.10.0PREDICTED: uncharacterized protein LOC18606476 isoform X2
TrEMBLA0A061GCB50.0A0A061GCB5_THECC; Myb domain protein 3r-5, putative isoform 1
STRINGEOY246590.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2070356
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G02320.21e-78myb domain protein 3r-5
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]