PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG042474t1
Common NameTCM_042474
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB
Protein Properties Length: 237aa    MW: 26196.2 Da    PI: 7.185
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG042474t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding57.63e-18954147
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
   Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47
                      +g+W+++Ed  l+++v+q+G+++W+ I++ ++ gR++k+c++rw + 
  Thecc1EG042474t1  9 KGSWSPQEDANLIRLVEQHGPRNWSMISSGIP-GRSGKSCRLRWCNQ 54
                      799*****************************.***********985 PP

2Myb_DNA-binding52.51.2e-1663105347
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
   Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                       ++T+ Ed  +++a++ +G++ W+tIar ++ gRt++ +k++w++ 
  Thecc1EG042474t1  63 PFTPAEDAVIIRAHAAHGNK-WATIARQLP-GRTDNAVKNHWNST 105
                       89******************.*********.***********986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5129419.573455IPR017930Myb domain
SuperFamilySSF466892.58E-307102IPR009057Homeodomain-like
SMARTSM007171.2E-15857IPR001005SANT/Myb domain
PfamPF002491.5E-17954IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.2E-241062IPR009057Homeodomain-like
CDDcd001672.72E-141153No hitNo description
PROSITE profilePS5129425.68956110IPR017930Myb domain
SMARTSM007174.1E-1460108IPR001005SANT/Myb domain
PfamPF002495.7E-1463105IPR001005SANT/Myb domain
CDDcd001679.48E-1163106No hitNo description
Gene3DG3DSA:1.10.10.606.3E-2463109IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 237 aa     Download sequence    Send to blast
MIKGGDRVKG SWSPQEDANL IRLVEQHGPR NWSMISSGIP GRSGKSCRLR WCNQLSPDVQ  60
HRPFTPAEDA VIIRAHAAHG NKWATIARQL PGRTDNAVKN HWNSTLRRKR AAELSSGSSE  120
SNNSAVKRWS SQDASESDSG NKRQCLRVEV HENVEFVGPK TLLTLSPPGE SVVSGHMEEK  180
VEDEEEEEVV KRDEEGGGRG GGGGGEEEKR RVEMKETCLL TIMQRMIKEE FVAFNG*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1gv2_A8e-3781093104MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1196204GGRGGGGGG
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007008918.11e-173PREDICTED: transcriptional activator Myb isoform X2
TrEMBLA0A061FLG61e-172A0A061FLG6_THECC; Myb domain protein 73
STRINGEOY177281e-173(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM67728135
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G50060.13e-54myb domain protein 77
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]