PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG030175t1
Common NameTCM_030175
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 721aa    MW: 78919.6 Da    PI: 6.0831
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG030175t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.99.5e-2057112156
                       TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       +++ +++t+ q++e+e++F+++++p+ ++r+eL+++lgL+  qVk+WFqN+R+++k
  Thecc1EG030175t1  57 KKRYHRHTQLQIQEMEAFFKECPHPDDKQRKELSRELGLEPLQVKFWFQNKRTQMK 112
                       688999***********************************************998 PP

2START223.95.1e-702464651206
                       HHHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEE CS
             START   1 elaeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaet 81 
                       ela +a++el+++a+++ep+Wv      + +n+de+l++f+++ +      ++ea+r+s+vv+m++++lve+l+d++ qW+  +     +a+t
  Thecc1EG030175t1 246 ELAVAAMEELIRMAQSGEPLWVPGEkstDVLNEDEYLRTFPRGIGpkplgLRSEASRESAVVIMNHVNLVEILMDVN-QWSSAFCgivsRAMT 337
                       57899****************97777779*************999********************************.******99999**** PP

                       EEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEE CS
             START  82 levissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwv 167
                       lev+s+g      galq+m+ae+q++splvp R+ +fvRy++q+++g+w++vdvS+d+ ++ p    + +++++pSg+li++++ng+skv+wv
  Thecc1EG030175t1 338 LEVLSTGvagnynGALQVMTAEFQVPSPLVPtRENYFVRYCKQHTDGTWAVVDVSLDNLRPSP----MSKCRRRPSGCLIQELPNGYSKVIWV 426
                       *************************************************************99....788999******************** PP

                       E-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START 168 ehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                       ehv++++r +h+++r+lv+sgla+gak+wvatl+rqce+
  Thecc1EG030175t1 427 EHVEVDDRAVHNIYRPLVNSGLAFGAKRWVATLDRQCER 465
                       *************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.0E-2140112IPR009057Homeodomain-like
SuperFamilySSF466892.76E-1942114IPR009057Homeodomain-like
PROSITE profilePS5007116.63354114IPR001356Homeobox domain
SMARTSM003898.0E-1955118IPR001356Homeobox domain
CDDcd000861.19E-1857115No hitNo description
PfamPF000462.4E-1757112IPR001356Homeobox domain
PROSITE patternPS00027089112IPR017970Homeobox, conserved site
PROSITE profilePS5084845.211237468IPR002913START domain
SuperFamilySSF559611.03E-36237467No hitNo description
CDDcd088759.60E-130241464No hitNo description
SMARTSM002344.1E-71246465IPR002913START domain
PfamPF018525.0E-59247465IPR002913START domain
Gene3DG3DSA:3.30.530.202.5E-6318465IPR023393START-like domain
SuperFamilySSF559614.71E-27485716No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009845Biological Processseed germination
GO:0009913Biological Processepidermal cell differentiation
GO:0048497Biological Processmaintenance of floral organ identity
GO:0048825Biological Processcotyledon development
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 721 aa     Download sequence    Send to blast
MFSPNLFDSP HMFDMTHKTS EGELGKIRDD DYETKSGTET MDVPSGDEQD PNQRPKKKRY  60
HRHTQLQIQE MEAFFKECPH PDDKQRKELS RELGLEPLQV KFWFQNKRTQ MKAQHERHEN  120
AILKAENEKL RAENNRYKEA LSNATCPNCG GPAALGEMSF DEQHLRIENA RLREEIDRIS  180
GIAAKYVGKP LTSFPHISSH LHSRSLDPGA SNFGTQSGFV GEMYGGGDLL RSVSGPTEAD  240
KPMIVELAVA AMEELIRMAQ SGEPLWVPGE KSTDVLNEDE YLRTFPRGIG PKPLGLRSEA  300
SRESAVVIMN HVNLVEILMD VNQWSSAFCG IVSRAMTLEV LSTGVAGNYN GALQVMTAEF  360
QVPSPLVPTR ENYFVRYCKQ HTDGTWAVVD VSLDNLRPSP MSKCRRRPSG CLIQELPNGY  420
SKVIWVEHVE VDDRAVHNIY RPLVNSGLAF GAKRWVATLD RQCERLASSM ASNIPAGDLC  480
VITSPEGRKS MLKLAERMVT SFCTGVGAST AHAWTTLSAT GSDDVRVMTR KSMDDPGRPP  540
GIVLSAATSF WIPVPPKRVF DFLRDENSRS EWDILSNGGL VQEMAHIANG RDPGNCVSLL  600
RVNSANSSQS NMLILQESCS DATGSYVIYA PVDIVAMNVV LSGGDPDYVA LLPSGFAILP  660
DGPGLNGGGI LEIGSGGSLL TVAFQILVDS VPTAKLSLGS VATVNSLIKC TVERIKAAVA  720
*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the L1 box DNA sequence 5'-TAAATG[CT]A-3'. Plays a role in maintaining the identity of L1 cells, possibly by interacting with their L1 box or other target-gene promoters. Functionally redundant to ATML1. {ECO:0000269|PubMed:12505995}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJN5859511e-145JN585951.1 Gossypium hirsutum HD-1A gene, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007026002.10.0PREDICTED: homeobox-leucine zipper protein MERISTEM L1
RefseqXP_007026003.10.0PREDICTED: homeobox-leucine zipper protein MERISTEM L1
RefseqXP_007026004.10.0PREDICTED: homeobox-leucine zipper protein MERISTEM L1
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLA0A061GGE70.0A0A061GGE7_THECC; Protodermal factor 2 isoform 1
STRINGEOY286240.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]
  3. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]