PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG024940t1
Common NameTCM_024940
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 360aa    MW: 39238.8 Da    PI: 8.369
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG024940t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox54.91.4e-17177231256
                       T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       rk+ +++keq   Lee F+++++++ +++  LAk+l+L  rqV vWFqNrRa+ k
  Thecc1EG024940t1 177 RKKLRLSKEQSAFLEESFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTK 231
                       778899***********************************************98 PP

2HD-ZIP_I/II127.84.6e-41177267192
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelk 92 
                       +kk+rlskeq+++LEesF+e+++L+p++K +la++L+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+ke +eLr +lk
  Thecc1EG024940t1 177 RKKLRLSKEQSAFLEESFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCETLTEENRRLQKELQELR-ALK 267
                       69*************************************************************************************9.555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046184.1E-870139IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.607.4E-18159234IPR009057Homeodomain-like
SuperFamilySSF466892.91E-18168234IPR009057Homeodomain-like
PROSITE profilePS5007117.216173233IPR001356Homeobox domain
SMARTSM003897.3E-16175237IPR001356Homeobox domain
PfamPF000464.7E-15177231IPR001356Homeobox domain
CDDcd000861.55E-15177234No hitNo description
PROSITE patternPS000270208231IPR017970Homeobox, conserved site
PfamPF021837.4E-11233267IPR003106Leucine zipper, homeobox-associated
SMARTSM003404.5E-26233276IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 360 aa     Download sequence    Send to blast
MELALSLGDP SKPFSFLDKA PKLSSKDLGF CMGLGNGFRS QEKGDAFEGE SRGEATRDGH  60
DKRVSSDPPL QLDLLPFSPV PRSQPPSQLR FPWLTDNQTG SSEGQGRGLD VNRLPVVAVM  120
DEAEDGAAMS SPNSAVSSFQ MDFGIRNGSG RGKRDLEVEN ERASSRASDD DENGSTRKKL  180
RLSKEQSAFL EESFKEHNTL NPKQKLALAK QLNLRPRQVE VWFQNRRART KLKQTEVDCE  240
YLKRCCETLT EENRRLQKEL QELRALKTSQ PFYMQLPATT LTMCPSCERV ATTSTTANST  300
AAAATTTTNG SAAAAAAGSN SDAKTGVLPL TKTRGYPFSP LPTHVTQSQP QPQAHQAAS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1175181TRKKLRL
2225233RRARTKLKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017977138.10.0PREDICTED: homeobox-leucine zipper protein HOX11
SwissprotP466651e-106HAT14_ARATH; Homeobox-leucine zipper protein HAT14
TrEMBLA0A061F4R60.0A0A061F4R6_THECC; Homeobox from
STRINGEOY095240.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49282852
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06710.16e-87homeobox from Arabidopsis thaliana
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]