PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG012571t2
Common NameTCM_012571
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB
Protein Properties Length: 701aa    MW: 80691.1 Da    PI: 9.6895
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG012571t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.31.5e-07220264246
                       SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
   Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                       ++WT+eEd+ l+  v++ G  +W  I   +g +Rt+ qc  r+q 
  Thecc1EG012571t2 220 NPWTAEEDKNLLFIVQEKGISNWFDIVVSLGSNRTPFQCLARYQR 264
                       58*****************************************96 PP

2Myb_DNA-binding43.19.5e-14273316246
                       SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
   Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         WT+eEd++l  av+ +G  +W ++a+t+  gRt+ qc +rw k
  Thecc1EG012571t2 273 REWTEEEDDQLRIAVEVFGECDWQSVASTLK-GRTGTQCSNRWIK 316
                       68****************************9.***********87 PP

3Myb_DNA-binding538e-17326369246
                       SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
   Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                       grWT +Ed++l+ av ++G+++W++Ia+ ++ gRt  qc++rw +
  Thecc1EG012571t2 326 GRWTHDEDKRLKVAVMLFGPKNWRKIAEVIP-GRTQVQCRERWVN 369
                       8******************************.***********87 PP

4Myb_DNA-binding48.12.6e-15378421247
                       SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
   Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                       grWT+eEd++l  a++++G   W+++a++m+  Rt++qc  rw ++
  Thecc1EG012571t2 378 GRWTKEEDLRLEAAIEEHGYY-WSKVAACMP-SRTDNQCWRRWKTL 421
                       89*****************99.*********.***********976 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.586117213IPR017877Myb-like domain
SMARTSM007174.8E-5118215IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.607.3E-7118137IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.3E-7185215IPR009057Homeodomain-like
SuperFamilySSF466893.33E-11195245IPR009057Homeodomain-like
PROSITE profilePS500908.653214266IPR017877Myb-like domain
SMARTSM007173.1E-6218268IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.4E-13221275IPR009057Homeodomain-like
CDDcd001676.20E-5221266No hitNo description
PfamPF139213.6E-7222268No hitNo description
SuperFamilySSF466894.67E-19246321IPR009057Homeodomain-like
PROSITE profilePS5129414.831267318IPR017930Myb domain
SMARTSM007174.8E-11271320IPR001005SANT/Myb domain
PfamPF002491.7E-11273316IPR001005SANT/Myb domain
CDDcd001673.01E-9274318No hitNo description
Gene3DG3DSA:1.10.10.609.8E-16276321IPR009057Homeodomain-like
PROSITE profilePS5129421.346319375IPR017930Myb domain
SuperFamilySSF466891.7E-26323418IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.0E-18323371IPR009057Homeodomain-like
SMARTSM007175.2E-15324373IPR001005SANT/Myb domain
PfamPF002491.2E-15326369IPR001005SANT/Myb domain
CDDcd001673.17E-12327371No hitNo description
Gene3DG3DSA:1.10.10.602.5E-17372422IPR009057Homeodomain-like
SMARTSM007171.0E-12376424IPR001005SANT/Myb domain
PfamPF002493.8E-13378421IPR001005SANT/Myb domain
PROSITE profilePS5129416.644378426IPR017930Myb domain
CDDcd001671.11E-10379421No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 701 aa     Download sequence    Send to blast
MTDPPLRTPP SRRLLWRIPT ARMIWKFSGV YGVVLRFQWT FVSLCLFSLP ALFLQFLLMT  60
MLKMISRLFA LSRGGFRLIL AMLLQVNDKN VSADYGPPEN SSVTNYRMAL TKFPLALQRK  120
KWSREERENL VKGIRQQFQE SALQVSVDWF SSADGSSGDG SNLDDIIATV KDLEITPERI  180
REFLPKVNWD QLASMYVKGR SGAECETRWL NHEDPLINCN PWTAEEDKNL LFIVQEKGIS  240
NWFDIVVSLG SNRTPFQCLA RYQRSLNACI LKREWTEEED DQLRIAVEVF GECDWQSVAS  300
TLKGRTGTQC SNRWIKSLHP TRQRVGRWTH DEDKRLKVAV MLFGPKNWRK IAEVIPGRTQ  360
VQCRERWVNS LDPALNLGRW TKEEDLRLEA AIEEHGYYWS KVAACMPSRT DNQCWRRWKT  420
LHPKAVPLLQ EARRIRKATL VSNFVDRESE RPALGPNDFY IPLQLTNSTS EPENTNLPSE  480
GKRKERRRII SAEDFENLPS SKKVEKRGNS SLRQHSRSRK RNELSGAKDD ATLASFLQDK  540
LKKNIPSYAD GDEMKLAGFL RNKSKKRRHQ IAENAHLSIM KGPEQRDKTN QIQFGLQRCE  600
AKTNCDGVIP ENSMFRSSLK QTIMSSDMNI VGDDIVNDTV APHEVVREPD RIDQEGNCEA  660
DGITLVQLRK RLKKRGPASS CMRRESELPS KELDEPRHND *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C8e-291884126155MYB PROTO-ONCOGENE PROTEIN
1h89_C8e-291884126155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1668674RKRLKKR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017973755.10.0PREDICTED: myb-like protein L
TrEMBLA0A061FVF20.0A0A061FVF2_THECC; Myb domain protein 4r1, putative isoform 2
STRINGEOY211900.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.21e-155myb domain protein 4r1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]