PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.008G117200.2
Common NameB456_008G117200, LOC105763830
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 805aa    MW: 87871.8 Da    PI: 6.0992
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.008G117200.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.24.5e-21123178156
                         TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         +++ +++t++q++eLe+lF+++++p++++r eL+k+l L++rqVk+WFqNrR+++k
  Gorai.008G117200.2 123 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMK 178
                         688999***********************************************999 PP

2START204.35e-643155361206
                         HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
               START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                         ela++a++elvk+a+ +ep+W k      e++n de+l++f++  +     + +ea+r++gvv+ ++  lve+l+d++ +W e++     +
  Gorai.008G117200.2 315 ELALAAMDELVKMAQTDEPLWIKNIeggrEMLNHDEYLRTFTPCIGlkpngFVTEASRETGVVIINSLALVETLMDSN-RWAEMFHcmiaR 404
                         5899**************************************99999***9***************************.************ PP

                         EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEE CS
               START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghs 162
                          +t++vis+g      galqlm+aelq lsplvp R++ f+R+++q+ +g+w++vdvS+d  ++ ++   +v +++lpSg+++++++ng+s
  Gorai.008G117200.2 405 TSTTDVISNGmggtrnGALQLMNAELQILSPLVPvREVSFLRFCKQHAEGVWAVVDVSIDTIKESTT---FVTCRRLPSGCVVQDMPNGYS 492
                         ************************************************************9999775...********************* PP

                         EEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
               START 163 kvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                         kv+wveh++++++++h+l+r+l++sg+ +ga++wvatlqrqce+
  Gorai.008G117200.2 493 KVIWVEHAEYDESQVHQLYRPLLSSGVGFGAQRWVATLQRQCEC 536
                         ******************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.7E-21102180IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.9E-22109180IPR009057Homeodomain-like
PROSITE profilePS5007117.216120180IPR001356Homeobox domain
SMARTSM003899.9E-18121184IPR001356Homeobox domain
CDDcd000861.28E-18122180No hitNo description
PfamPF000461.3E-18123178IPR001356Homeobox domain
PROSITE patternPS000270155178IPR017970Homeobox, conserved site
PROSITE profilePS5084844.133306539IPR002913START domain
SuperFamilySSF559611.87E-32308536No hitNo description
CDDcd088753.37E-123310535No hitNo description
PfamPF018527.8E-56315536IPR002913START domain
SMARTSM002341.1E-49315536IPR002913START domain
SuperFamilySSF559616.7E-24564797No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 805 aa     Download sequence    Send to blast
MNFGGFIDDS SGPNEGLGGA RIVADVPYNT TTMPTGVFSQ PRLVSSSIPK NMFNSPGLSL  60
ALQQPNIDNQ GDETRLGENF EGSIGRRSRE EEHESRSGSD NMDGGSGDDH DPTTAAGDKP  120
PRKKRYHRHT PQQIQELEAL FKECPHPDEK QRLELSKRLC LETRQVKFWF QNRRTQMKTQ  180
LERHENSLLR QENDKLRAEN MSIRDAMRNP ICTNCGGPAI IGDMSLEEQH LRIENARLKD  240
ELDRVCALAG KFLGRPITGP PLPNSSLELG VGTNGTFGTT MATTTTLPLG HDALPTMVVP  300
SNRPATTLDR SMFLELALAA MDELVKMAQT DEPLWIKNIE GGREMLNHDE YLRTFTPCIG  360
LKPNGFVTEA SRETGVVIIN SLALVETLMD SNRWAEMFHC MIARTSTTDV ISNGMGGTRN  420
GALQLMNAEL QILSPLVPVR EVSFLRFCKQ HAEGVWAVVD VSIDTIKEST TFVTCRRLPS  480
GCVVQDMPNG YSKVIWVEHA EYDESQVHQL YRPLLSSGVG FGAQRWVATL QRQCECLAIL  540
MSSTVPTRDH TAITASGRRS MLKLAQRMTD NFCAGVCAST VHKWNKLNAG NVDEDVRVMT  600
RKSIDDPGEP PGIVLSAATS VWLPVSPQRL FDFLRDERLR SEWDILSNGG PMQEMAHIAK  660
GQDHGNCVSL LRASAMNANQ SSMLILQETC IDAAGSLVVY APVDIPAMHV VMNGGDSAYV  720
ALLPSGFAIV PDGPGSHGPI SNGHVNGNTG GGSSRVGGSL LTVAFQILVN SLPTAKLTVE  780
SVETVNNLIS CTVQKIKAAL QCES*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gra.2170.0flower|seedling| flowering
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankEU5834970.0EU583497.1 Gossypium hirsutum homeodomain protein GL2-like 1 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012437660.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0D2U1I80.0A0A0D2U1I8_GOSRA; Uncharacterized protein
STRINGGorai.008G117200.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]