PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG010868t1
Common NameTCM_010868
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 232aa    MW: 25448.9 Da    PI: 8.4973
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG010868t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.81e-1972126256
                       T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       rk+ ++tkeq + Lee F++n++++ +++e LA++l+L  rqV vWFqNrRa+ k
  Thecc1EG010868t1  72 RKKLRLTKEQSRLLEESFRQNHTLNPKQKEALAMQLKLRPRQVEVWFQNRRARSK 126
                       78889************************************************98 PP

2HD-ZIP_I/II118.53.6e-3872160189
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLre 89 
                       +kk+rl+keq++lLEesF+++++L+p++K++la +L+l+prqv+vWFqnrRAR k+kq+E+++e+Lkr++ +l+e+n+rL++eveeLr+
  Thecc1EG010868t1  72 RKKLRLTKEQSRLLEESFRQNHTLNPKQKEALAMQLKLRPRQVEVWFQNRRARSKLKQTEMECEYLKRWFGSLTEQNRRLQREVEELRA 160
                       69*************************************************************************************94 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.1E-1858126IPR009057Homeodomain-like
SuperFamilySSF466891.97E-1958129IPR009057Homeodomain-like
PROSITE profilePS5007117.4168128IPR001356Homeobox domain
SMARTSM003898.5E-1870132IPR001356Homeobox domain
CDDcd000863.48E-1771129No hitNo description
PfamPF000464.8E-1772126IPR001356Homeobox domain
PROSITE patternPS000270103126IPR017970Homeobox, conserved site
PfamPF021837.4E-10128162IPR003106Leucine zipper, homeobox-associated
SMARTSM003404.4E-22128171IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 232 aa     Download sequence    Send to blast
MAVLPTGSSN LELTISVPGF SSSPSLPSSG DQGGCTVRDL DINQVPSGGA EDEWITASME  60
DEEESCNGAP PRKKLRLTKE QSRLLEESFR QNHTLNPKQK EALAMQLKLR PRQVEVWFQN  120
RRARSKLKQT EMECEYLKRW FGSLTEQNRR LQREVEELRA MKVGPPTVIS PHSCEPLPAS  180
TLTMCPRCER VTTTALDKGP TKMTAATATA TTLSSKVGTS ALQSRPSSAA C*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1120128RRARSKLKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription repressor that binds to the DNA sequence 5'-CAAT[GC]ATTG-3'. {ECO:0000269|PubMed:10732669}.
UniProtProbable transcription repressor that binds to the DNA sequence 5'-CAAT[GC]ATTG-3'. {ECO:0000269|PubMed:10732669}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007045132.11e-169PREDICTED: homeobox-leucine zipper protein HOX3
SwissprotQ0JKX13e-87HOX3_ORYSJ; Homeobox-leucine zipper protein HOX3
SwissprotQ9XH383e-87HOX3_ORYSI; Homeobox-leucine zipper protein HOX3
TrEMBLA0A061EF991e-168A0A061EF99_THECC; Homeobox-leucine zipper protein HOX3
STRINGEOY009641e-169(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48902747
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01430.12e-75homeobox-leucine zipper protein 17
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]