PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400004701
Common NameLOC102591633
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 773aa    MW: 85945.9 Da    PI: 6.5162
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400004701genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.95.4e-21113168156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           r+k +++t +q++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  PGSC0003DMP400004701 113 RKKYHRHTVQQIREMEALFKESPHPDEKQRQQLSKQLGLHPRQVKFWFQNRRTQIK 168
                           7999************************************************9877 PP

2START214.14.8e-672875093206
                           HHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.....SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
                 START   3 aeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv....dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                            ++a+++l k+a+++ep+W ks     e++n+de++++f++++v      +ea+r++g+v+m+l++lv++++d++ qW e+++    ka
  PGSC0003DMP400004701 287 VNQAMEQLTKMATSGEPLWIKSFetgrEILNYDEYTKEFPPGDVkskrMCIEASRDTGIVFMELPRLVQTFMDVN-QWREMFPsiisKA 374
                           5789******************99********************999999*************************.************* PP

                           EEEEEECTT........EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECT CS
                 START  80 etlevissg........galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksn 159
                           +t++vi++g        ga+qlm+ae q+l+p+v  R+++fvRy++q++a +w ivdvSvd  +    ++s+ ++++lpSg+++++ sn
  PGSC0003DMP400004701 375 ATVDVICNGteganswdGAVQLMFAEVQMLTPVVGtREVYFVRYCKQMSAAQWGIVDVSVDKVEGSI-DASLLKCRKLPSGCILQEQSN 462
                           *****************************************************************87.9******************** PP

                           CEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 160 ghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                            h+kvtwveh +++++++++l+r++v+sg+a+ga++w+atlq+qce+
  PGSC0003DMP400004701 463 AHCKVTWVEHLECQKSIVDSLYRVTVNSGQAFGARRWMATLQQQCER 509
                           *********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.604.6E-2297164IPR009057Homeodomain-like
SuperFamilySSF466899.61E-2198171IPR009057Homeodomain-like
PROSITE profilePS5007117.799110170IPR001356Homeobox domain
SMARTSM003891.9E-18112174IPR001356Homeobox domain
PfamPF000462.8E-18113168IPR001356Homeobox domain
CDDcd000862.17E-16117168No hitNo description
PROSITE patternPS000270145168IPR017970Homeobox, conserved site
PROSITE profilePS5084835.066276512IPR002913START domain
SuperFamilySSF559611.51E-30278509No hitNo description
CDDcd088756.93E-104280508No hitNo description
SMARTSM002342.5E-62285509IPR002913START domain
PfamPF018523.7E-54287509IPR002913START domain
Gene3DG3DSA:3.30.530.203.6E-4288505IPR023393START-like domain
SuperFamilySSF559612.06E-12552760No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 773 aa     Download sequence    Send to blast
MVVVDMSNNH PPSHETKDFF PSPALSLSLA GIFRDGGGAG SSAGNMETTE EVDEGSAAAS  60
RGGRPREETS TVEISSENSE QMRSRGSDDD LEHDDTCNED EEDPNNNSKK KKRKKYHRHT  120
VQQIREMEAL FKESPHPDEK QRQQLSKQLG LHPRQVKFWF QNRRTQIKAI QERHENSLLK  180
AEIEKLREEN KGLRGTSKNP SCPNCGFASS SNNAPTLPAE EQQLRIENAR LRAEVEKLRA  240
ALGKYPLGTS PNSSSSCSGG NDEENKSALD FYTGIFGLEK PRIMHIVNQA MEQLTKMATS  300
GEPLWIKSFE TGREILNYDE YTKEFPPGDV KSKRMCIEAS RDTGIVFMEL PRLVQTFMDV  360
NQWREMFPSI ISKAATVDVI CNGTEGANSW DGAVQLMFAE VQMLTPVVGT REVYFVRYCK  420
QMSAAQWGIV DVSVDKVEGS IDASLLKCRK LPSGCILQEQ SNAHCKVTWV EHLECQKSIV  480
DSLYRVTVNS GQAFGARRWM ATLQQQCERL LFFMATNIPT KDTTGVATLA GRKSILTLAQ  540
RMTWGFYRVL GASSYNTWNK VPSKTGQEDI RMTSRRNLTD PGEPQGLILC AVSSIWLPVS  600
RNVLFDFLKD ENHRHEWDVM SNGGPVQSVA NLAKGQDKGN AVSIQAVKLR ENNMWILQDT  660
CTNAYESAVV YAPVDIAGMQ SVITGCDSSN IAVLPSGFSI LPDGLESRPF VITSRPEDRS  720
SEGGSLLTVA FQILTSNSPT TKLSKESVES INNLLSCTLH KIKTRFQCDN GY*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1108113KKKKRK
2108114KKKKRKK
3110114KKRKK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor required for correct morphological development and maturation of trichomes as well as for normal development of seed coat mucilage. Regulates the frequency of trichome initiation and determines trichome spacing. {ECO:0000269|PubMed:11844112}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400004701
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: Down-regulated by GEM. {ECO:0000269|PubMed:17450124}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755151e-176HG975515.1 Solanum lycopersicum chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006343080.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLM0ZRY50.0M0ZRY5_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000067700.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA62282328
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]
  2. Wu R,Citovsky V
    Adaptor proteins GIR1 and GIR2. I. Interaction with the repressor GLABRA2 and regulation of root hair development.
    Biochem. Biophys. Res. Commun., 2017. 488(3): p. 547-553
    [PMID:28526410]