PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.009G135100.2
Common NameB456_009G135100, LOC105767919
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 759aa    MW: 83885.9 Da    PI: 6.5142
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.009G135100.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.22.2e-21103158156
                         TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         r+k +++t++q++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  Gorai.009G135100.2 103 RKKYHRHTADQIREMEALFKESPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 158
                         7999************************************************9877 PP

2START232.51.1e-722734962206
                         HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
               START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                         + ++a++el+k+a+a+ep+Wv+s+    e++n+de++++f+ + +      +s+ea+r++gvv+ +l++lv++++d + qW+e+++    k
  Gorai.009G135100.2 273 IVNQAMEELQKMATAGEPLWVRSVetgrEILNYDEYVKEFSVESSsngrpkRSIEASRETGVVFLDLPRLVQSFMDAN-QWKEMFPciisK 362
                         56789***********************************77666899******************************.************ PP

                         EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEE CS
               START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghs 162
                         a+t++vi+ g      ga+qlm+aelq+l+plvp R+++fvRy++ql+a++w+ivdvS+d  +++  ++s+v+++++pSg++i++k ngh+
  Gorai.009G135100.2 363 AATVDVICHGeapnknGAVQLMFAELQMLTPLVPtREVYFVRYCKQLSAEQWAIVDVSIDKVEENI-DASLVKCRKRPSGCIIQDKTNGHC 452
                         ****************************************************************98.9*********************** PP

                         EEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
               START 163 kvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                         kv+wveh +++++++h l+r +v+sgla+ga++w+atlq+qce+
  Gorai.009G135100.2 453 KVIWVEHLECQKNTVHTLFRTIVRSGLAFGARHWMATLQHQCER 496
                         ******************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.608.4E-2389154IPR009057Homeodomain-like
SuperFamilySSF466892.26E-2091161IPR009057Homeodomain-like
PROSITE profilePS5007118.091100160IPR001356Homeobox domain
SMARTSM003898.6E-19102164IPR001356Homeobox domain
PfamPF000461.0E-18103158IPR001356Homeobox domain
CDDcd000869.54E-17107158No hitNo description
PROSITE patternPS000270135158IPR017970Homeobox, conserved site
PROSITE profilePS5084839.648263499IPR002913START domain
SuperFamilySSF559617.42E-34265496No hitNo description
CDDcd088751.62E-113267495No hitNo description
Gene3DG3DSA:3.30.530.209.7E-6270495IPR023393START-like domain
SMARTSM002348.5E-74272496IPR002913START domain
PfamPF018521.9E-58273496IPR002913START domain
SuperFamilySSF559611.65E-14526745No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 759 aa     Download sequence    Send to blast
MGVVDMTNPP TKDFFASPAL SLSLAGIFRD AGATAAAPTA SASMEVEEGD EGSGGGGGSG  60
SKKDDTVEIS SENSGPARSR SEDDLLDHDD DEDDADKSKK KKRKKYHRHT ADQIREMEAL  120
FKESPHPDEK QRQQLSKQLG LAPRQVKFWF QNRRTQIKAI QERHENSLLK QELDKLRDEN  180
KAMRETINKA CCLNCGMATT AKDGSITAEE QQLRIENAKL KAEVEKLRTV IGKYPPGAST  240
TGSCSSENDQ ENRSSLDFYT GIFGLEKSRI MEIVNQAMEE LQKMATAGEP LWVRSVETGR  300
EILNYDEYVK EFSVESSSNG RPKRSIEASR ETGVVFLDLP RLVQSFMDAN QWKEMFPCII  360
SKAATVDVIC HGEAPNKNGA VQLMFAELQM LTPLVPTREV YFVRYCKQLS AEQWAIVDVS  420
IDKVEENIDA SLVKCRKRPS GCIIQDKTNG HCKVIWVEHL ECQKNTVHTL FRTIVRSGLA  480
FGARHWMATL QHQCERLVFF MATNVPTKDS TGVATLAGRK SILKLAQRMT WSFCHSIGAS  540
SYHTWNKVST KTGEDIRVSS RKNLNDPGEP HGVIVCAVSS VWLPVSPTLL FDFLRDESRR  600
SEWDIMSNGG PVQSIANLAK GKDRGNAVTI QAMKSKENSM WVLQDSCTNA FESMVVFAHV  660
DVTGIQSVIT GCDSSNMAIL PSGFSILPDG LESRPLVISS RHEKSNDTEG GSLLTVAFQI  720
LTNSSPTAKL TMESVESVNT IVSCTLRNIK TSLQCEDG*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
198103KKKKRK
298104KKKKRKK
3100104KKRKK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gra.9300.0seedling
Expression -- Microarray ? help Back to Top
Source ID E-value
GEO487527030.0
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in developing trichomes.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor required for correct morphological development and maturation of trichomes as well as for normal development of seed coat mucilage. Regulates the frequency of trichome initiation and determines trichome spacing. {ECO:0000269|PubMed:11844112}.
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: Down-regulated by GEM. {ECO:0000269|PubMed:17450124}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF5309130.0AF530913.1 Gossypium hirsutum homeodomain protein GhHOX1 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012442963.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2
RefseqXP_012442964.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLA0A0D2TJ790.0A0A0D2TJ79_GOSRA; Uncharacterized protein
STRINGGorai.009G135100.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]
  2. Wu R,Citovsky V
    Adaptor proteins GIR1 and GIR2. I. Interaction with the repressor GLABRA2 and regulation of root hair development.
    Biochem. Biophys. Res. Commun., 2017. 488(3): p. 547-553
    [PMID:28526410]