PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000867t1
Common NameTCM_000867
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 285aa    MW: 31418.4 Da    PI: 8.5649
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000867t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.91.7e-18140194256
                       T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       rk+ ++tk+q   Lee F+++++++ ++++ LA++l+L  rqV vWFqNrRa+ k
  Thecc1EG000867t1 140 RKKLRLTKDQSALLEESFKQHSTLNPKQKQALARQLSLRPRQVEVWFQNRRARTK 194
                       788899***********************************************98 PP

2HD-ZIP_I/II124.35.7e-40140229191
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                       +kk+rl+k+q++lLEesF+++++L+p++K++lar+L l+prqv+vWFqnrRARtk+kq+E+d+e+Lk+++++l++en+rL+ke +eL+ +l
  Thecc1EG000867t1 140 RKKLRLTKDQSALLEESFKQHSTLNPKQKQALARQLSLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELK-AL 229
                       69*************************************************************************************9.55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046181.1E-410111IPR006712HD-ZIP protein, N-terminal
PROSITE profilePS5007117.184136196IPR001356Homeobox domain
SMARTSM003894.0E-15138200IPR001356Homeobox domain
SuperFamilySSF466895.99E-18138204IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.3E-17140194IPR009057Homeodomain-like
CDDcd000861.41E-16140197No hitNo description
PfamPF000467.0E-16140194IPR001356Homeobox domain
PROSITE patternPS000270171194IPR017970Homeobox, conserved site
PfamPF021835.4E-10196230IPR003106Leucine zipper, homeobox-associated
SMARTSM003405.7E-26196239IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009735Biological Processresponse to cytokinin
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 285 aa     Download sequence    Send to blast
MGLDDACNTG LVLGLGFSST LETPSKANNQ TPKKSSCLKF EPTAMAAASF EPSLTLGLSG  60
ESYQVVTASK KIDVNKGGYH HHEEPAAAGD LYRQASPHSA VSSFSSGRVK RERDLSSEEV  120
EVEKNSSRVS DEDEDGVNAR KKLRLTKDQS ALLEESFKQH STLNPKQKQA LARQLSLRPR  180
QVEVWFQNRR ARTKLKQTEV DCEFLKKCCE TLTDENRRLQ KELQELKALK LAQPFYMHMP  240
AATLTMCPSC ERIGGVGDGN SKSPFSMASK PHFYNPFTNP SAAC*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1188196RRARTKLKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00477DAPTransfer from AT4G37790Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHM1347930.0HM134793.1 Gossypium hirsutum homeodomain-leucine zipper protein HD4 (HB4) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017970499.10.0PREDICTED: homeobox-leucine zipper protein HAT22
SwissprotP466041e-113HAT22_ARATH; Homeobox-leucine zipper protein HAT22
TrEMBLA0A061DHW30.0A0A061DHW3_THECC; Homeodomain-leucine zipper protein HD4
STRINGEOX917920.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16712886
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G37790.15e-99HD-ZIP family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]
  2. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]
  3. Liu T,Longhurst AD,Talavera-Rauh F,Hokin SA,Barton MK
    The Arabidopsis transcription factor ABIG1 relays ABA signaled growth inhibition and drought induced senescence.
    Elife, 2017.
    [PMID:27697148]
  4. Song L, et al.
    A transcription factor hierarchy defines an environmental stress response network.
    Science, 2017.
    [PMID:27811239]