PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID HL.SW.v1.0.G021599.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Humulus
Family HD-ZIP
Protein Properties Length: 1073aa    MW: 119354 Da    PI: 5.3262
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
HL.SW.v1.0.G021599.1genomeHOPBASEView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.63.3e-21100155156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           r+k +++t+eq++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  HL.SW.v1.0.G021599.1 100 RKKYHRHTAEQIREMEALFKESPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 155
                           7999************************************************9877 PP

2Homeobox66.63.3e-21418473156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           r+k +++t+eq++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  HL.SW.v1.0.G021599.1 418 RKKYHRHTAEQIREMEALFKESPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 473
                           7999************************************************9877 PP

3START38.62.7e-13270347369
                           HHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS........SCEEEEEEEECCSCHHHHHHHHHCCCG CS
                 START   3 aeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.......dsgealrasgvvdmvlallveellddk 69 
                            ++ ++el+k+a+a+ep+W +s+    e++n+de+l++f+ +++       +s+ea+r++gvv+ +l++lv++++d++
  HL.SW.v1.0.G021599.1 270 VNQSMEELIKMATAGEPLWIRSVetgrEILNYDEYLKEFSVENSamnngpkRSVEASREVGVVFVDLPRLVQSFMDVA 347
                           57889**********************************988889******************************985 PP

4START232.71e-725888113206
                           HHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S... CS
                 START   3 aeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.......dsgealrasgvvdmvlallveellddkeqWdetla... 77 
                            ++ ++el+k+a+a+ep+W +s+    e++n+de+l++f+ +++       +s+ea+r++gvv+ +l++lv++++d++ qW+e+++   
  HL.SW.v1.0.G021599.1 588 VNQSMEELIKMATAGEPLWIRSVetgrEILNYDEYLKEFSVENSamnngpkRSVEASREVGVVFVDLPRLVQSFMDVN-QWKEMFPcmi 675
                           57889**********************************988889*********************************.********** PP

                           .EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEEC CS
                 START  78 .kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepks 158
                            ka+t++vis+g      ga+qlm+aelq+l+p+vp R+++fvRy++ql+a++w++vdvS+d+ +++  ++s+++++++pSg++ie+ks
  HL.SW.v1.0.G021599.1 676 sKAATVDVISNGeadnknGAVQLMFAELQMLTPMVPtREVYFVRYCKQLSAEQWAVVDVSIDNVEENI-DASLIKCRKRPSGCIIEDKS 763
                           ******************************************************************98.9******************* PP

                           TCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 159 nghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                           ngh+kvtwveh +++++++h ++r +v+sgla+ga++w atlq qce+
  HL.SW.v1.0.G021599.1 764 NGHCKVTWVEHLECQKSTVHTMYRTIVNSGLAFGARHWIATLQLQCER 811
                           **********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.2E-2385151IPR009057Homeodomain-like
SuperFamilySSF466891.63E-2090158IPR009057Homeodomain-like
PROSITE profilePS5007118.2297157IPR001356Homeobox domain
SMARTSM003898.3E-1999161IPR001356Homeobox domain
PfamPF000461.6E-18100155IPR001356Homeobox domain
CDDcd000864.10E-16104155No hitNo description
PROSITE patternPS000270132155IPR017970Homeobox, conserved site
PROSITE profilePS5084810.465259301IPR002913START domain
PfamPF018521.4E-6269347IPR002913START domain
Gene3DG3DSA:1.10.10.602.2E-23403469IPR009057Homeodomain-like
SuperFamilySSF466891.63E-20408476IPR009057Homeodomain-like
PROSITE profilePS5007118.22415475IPR001356Homeobox domain
SMARTSM003898.3E-19417479IPR001356Homeobox domain
PfamPF000461.6E-18418473IPR001356Homeobox domain
CDDcd000864.10E-16422473No hitNo description
PROSITE patternPS000270450473IPR017970Homeobox, conserved site
PROSITE profilePS5084841.315577814IPR002913START domain
SuperFamilySSF559611.1E-34579811No hitNo description
CDDcd088753.39E-114581810No hitNo description
SMARTSM002344.1E-75586811IPR002913START domain
PfamPF018523.5E-58587811IPR002913START domain
Gene3DG3DSA:3.30.530.202.9E-6627806IPR023393START-like domain
SuperFamilySSF559614.26E-148411061No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1073 aa     Download sequence    Send to blast
MGVDMSNNTP PNSRTKDYFA SPALSLSLAG IFRDAGAAAA AAANIEVEEG DEGSGGGRKD  60
DTVEISSETS GPARSRSDDE FEAEGDEDDG DGDKSKKKKR KKYHRHTAEQ IREMEALFKE  120
SPHPDEKQRQ QLSKQLGLAP RQVKFWFQNR RTQIKAIQER HENSLLKSEI DKLRDENKSL  180
REQINKSCCP NCGSATVARD PTNSTDEQQL RIENAKLKAE VEKLRATIRK YPMGTSSPSC  240
SAGNEQETRS SLEFYTGIFG LEKSRMMEIV NQSMEELIKM ATAGEPLWIR SVETGREILN  300
YDEYLKEFSV ENSAMNNGPK RSVEASREVG VVFVDLPRLV QSFMDVAGIF RDAGAAAAAA  360
ANIEVEEGDE GSGGGRKDDT VEISSETSGP ARSRSDDEFE AEGDEDDGDG DKSKKKKRKK  420
YHRHTAEQIR EMEALFKESP HPDEKQRQQL SKQLGLAPRQ VKFWFQNRRT QIKAIQERHE  480
NSLLKSEIDK LRDENKSLRE QINKSCCPNC GSATVARDPT NSTDEQQLRI ENAKLKAEVE  540
KLRATIRKYP MGTSSPSCSA GNEQETRSSL EFYTGIFGLE KSRMMEIVNQ SMEELIKMAT  600
AGEPLWIRSV ETGREILNYD EYLKEFSVEN SAMNNGPKRS VEASREVGVV FVDLPRLVQS  660
FMDVNQWKEM FPCMISKAAT VDVISNGEAD NKNGAVQLMF AELQMLTPMV PTREVYFVRY  720
CKQLSAEQWA VVDVSIDNVE ENIDASLIKC RKRPSGCIIE DKSNGHCKVT WVEHLECQKS  780
TVHTMYRTIV NSGLAFGARH WIATLQLQCE RLVFFMATNV PMKDSTGVAT LAGRKSILKL  840
AQRMTWSFCR AIAASSYNTW TKVSSKTGDD IRITSRKNLN DPGEPLGLIL CAVSSVWLPV  900
SANVLFDFLR DENHRNEWDI ISSGGSVESM ANLSKGQDRG NAVSIQTVNS KENSMWVLQD  960
CCTNPFESMV VYAPVNISGM QSVMTGCDSS NLAVLPSGFS ILPDGIESRP LVITSRPEEK  1020
SSEGGSLLTV AFQILTNSSP TAKLTMESVD SVNTLISCTL KNIKTSLQCE EG*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1413418KKKKRK
2413419KKKKRKK
3415419KKRKK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor required for correct morphological development and maturation of trichomes as well as for normal development of seed coat mucilage. Regulates the frequency of trichome initiation and determines trichome spacing. {ECO:0000269|PubMed:11844112}.
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: Down-regulated by GEM. {ECO:0000269|PubMed:17450124}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010098082.10.0homeobox-leucine zipper protein GLABRA 2
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLA0A2P5ES900.0A0A2P5ES90_TREOI; Octamer-binding transcription factor
STRINGXP_010098082.10.0(Morus notabilis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF47143352
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Wu R,Citovsky V
    Adaptor proteins GIR1 and GIR2. I. Interaction with the repressor GLABRA2 and regulation of root hair development.
    Biochem. Biophys. Res. Commun., 2017. 488(3): p. 547-553
    [PMID:28526410]