PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.001G171000.6
Common NameB456_001G171000, LOC105797313
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 488aa    MW: 53395.6 Da    PI: 6.2583
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.001G171000.6genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.12.3e-21107162156
                         TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         +++ +++t++q++eLe+lF+++++p++++r eL+k+l L++rqVk+WFqNrR+++k
  Gorai.001G171000.6 107 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMK 162
                         688999***********************************************999 PP

2START144.78.9e-463024861165
                         HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
               START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                         ela++a++elvk+a+ +ep+W +s     e++n de+ + f++  +     + +ea+r +g+v+ ++  lve+l+d++ +W e+++    +
  Gorai.001G171000.6 302 ELALAAMDELVKMAQTDEPLWIRSLevgrEILNHDEYSRMFTPCIGikpagFLTEASRQTGLVIINSLALVETLMDSN-RWAEMFPcmiaR 391
                         5899*************************************9998899******************************.************ PP

                         EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCE CS
               START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksngh 161
                          +t++vissg      ga+qlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvS+d  ++ +     +v++++lpSg+++++++ng+
  Gorai.001G171000.6 392 TSTTDVISSGmggtrnGAIQLMHAELQLLSPLVPvREVNFLRFCKQHAEGVWAVVDVSIDTLRETSGaPTTYVKCRRLPSGCVVQDMPNGY 482
                         ****************************************************************9998899******************** PP

                         EEEE CS
               START 162 skvt 165
                         skvt
  Gorai.001G171000.6 483 SKVT 486
                         ***8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466894.11E-2191164IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.0E-2196164IPR009057Homeodomain-like
PROSITE profilePS5007117.41104164IPR001356Homeobox domain
SMARTSM003899.9E-18105168IPR001356Homeobox domain
PfamPF000461.0E-18107162IPR001356Homeobox domain
CDDcd000862.51E-18107164No hitNo description
PROSITE patternPS000270139162IPR017970Homeobox, conserved site
PROSITE profilePS5084834.772293487IPR002913START domain
SuperFamilySSF559612.88E-25295487No hitNo description
CDDcd088752.34E-99297486No hitNo description
PfamPF018525.9E-38302486IPR002913START domain
SMARTSM002347.0E-15302487IPR002913START domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 488 aa     Download sequence    Send to blast
MNFGGSLNNS SGGGLGGATI VADIPFSNNM AAAGAMAQNI YNSPGLSLAL QQPSIGNQGD  60
GVRMGENFEA SIGRRSREEE HESRSGSDNI DGVSGDDQDA ANNRPRKKRY HRHTPQQIQE  120
LEALFKECPH PDEKQRLELS KRLCLETRQV KFWFQNRRTQ MKTQLERHEN SLLRQENDKL  180
RAENMSIREA MRNPICTNCG GPAIIGDLSL EEQHLRIENA RLKDELDRVC ALASKFLGRP  240
LSSLATSIAS PLPNSNLELG VGSNGFGGLS TTLPLGPDFG GGVSNSLPVV PPNGVERSMF  300
LELALAAMDE LVKMAQTDEP LWIRSLEVGR EILNHDEYSR MFTPCIGIKP AGFLTEASRQ  360
TGLVIINSLA LVETLMDSNR WAEMFPCMIA RTSTTDVISS GMGGTRNGAI QLMHAELQLL  420
SPLVPVREVN FLRFCKQHAE GVWAVVDVSI DTLRETSGAP TTYVKCRRLP SGCVVQDMPN  480
GYSKVTF*
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012482732.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0D2PSP00.0A0A0D2PSP0_GOSRA; Uncharacterized protein
STRINGGorai.001G171000.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.20.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]