PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D12G1049
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 814aa    MW: 89057.1 Da    PI: 5.8354
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D12G1049genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.24.5e-21122177156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  +++ +++t++q++eLe+lF+++++p++++r eL+k+l L++rqVk+WFqNrR+++k
  Gh_D12G1049 122 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMK 177
                  688999***********************************************999 PP

2START204.25.3e-643145351206
                  HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
        START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                  ela++a++elvk+a+ +ep+W k      e++n de+l++f++  +     + +ea+r++gvv+ ++  lve+l+d++ +W e++     + +t++vi
  Gh_D12G1049 314 ELALAAMDELVKMAQTDEPLWIKNIeggrEMLNHDEYLRTFTPCIGlkpngFVTEASRETGVVIINSLALVETLMDSN-RWAEMFHcmiaRTSTTDVI 410
                  5899**************************************99999***9***************************.******************* PP

                  CTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSX CS
        START  86 ssg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrl 176
                  s+g      galqlm+aelq lsplvp R++ f+R+++q+ +g+w++vdvSvd  ++ ++   +v +++lpSg+++++++ng+skv+wveh+++++++
  Gh_D12G1049 411 SNGmggtrnGALQLMNAELQILSPLVPvREVSFLRFCKQHAEGVWAVVDVSVDTIKESTT---FVTCRRLPSGCVVQDMPNGYSKVIWVEHAEYDESQ 505
                  *****************************************************9999775...*********************************** PP

                  XHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
        START 177 phwllrslvksglaegaktwvatlqrqcek 206
                  +h+l+r+l++sg+ +ga++wva+lqrqce+
  Gh_D12G1049 506 VHQLYRPLLSSGVGFGAQRWVAALQRQCEC 535
                  ****************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.7E-21101179IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.0E-22108179IPR009057Homeodomain-like
PROSITE profilePS5007117.216119179IPR001356Homeobox domain
SMARTSM003899.9E-18120183IPR001356Homeobox domain
CDDcd000861.30E-18121179No hitNo description
PfamPF000461.4E-18122177IPR001356Homeobox domain
PROSITE patternPS000270154177IPR017970Homeobox, conserved site
PROSITE profilePS5084844.01305538IPR002913START domain
SuperFamilySSF559611.54E-32307535No hitNo description
CDDcd088757.60E-122309534No hitNo description
SMARTSM002344.9E-50314535IPR002913START domain
PfamPF018525.0E-56314535IPR002913START domain
SuperFamilySSF559617.14E-24563796No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 814 aa     Download sequence    Send to blast
MNFGGFIDDS SGPNDGLGGA RIVADVPYNT PTMPTGVFSQ PRLVSSSIPK NMFNSPGLSL  60
ALQPNIDNQG DETRLGENFE GSIGRRSREE EHESRSGSDN MDGGSGDDHD PTTAAGDKPP  120
RKKRYHRHTP QQIQELEALF KECPHPDEKQ RLELSKRLCL ETRQVKFWFQ NRRTQMKTQL  180
ERHENSLLRQ ENDKLRAENM SIRDAMRNPI CTNCGGPAII GDMSLEEQLL RIENARLKDE  240
LDRVCALAGK FLGRPITGPP LPNSSLELGV GTNGTFGTTM ATTTTLPLGH DALPTMVVPS  300
NRPATTLDRS MFLELALAAM DELVKMAQTD EPLWIKNIEG GREMLNHDEY LRTFTPCIGL  360
KPNGFVTEAS RETGVVIINS LALVETLMDS NRWAEMFHCM IARTSTTDVI SNGMGGTRNG  420
ALQLMNAELQ ILSPLVPVRE VSFLRFCKQH AEGVWAVVDV SVDTIKESTT FVTCRRLPSG  480
CVVQDMPNGY SKVIWVEHAE YDESQVHQLY RPLLSSGVGF GAQRWVAALQ RQCECLAILM  540
SSTVPTRDHT AITASGRRSM LKLAQRMTDN FCAGVCASTV HKWNKLNAGN VDEDVRVMTR  600
KSIDDPGEPP GIVLSAATSV WLPVSPQRLF DFLRDERLRS EWDILSNGGP MQEMAHIAKG  660
QDHGNCVSLL RASAMNANQS SMLILQETCI DAAGSLVVYA PVDIPAMHVV MNGGDSAYVA  720
LLPSGFAIVP DGPRSHGPIS NGHVNGNTGG GSSSVGGSLL TVAFQILVNS LPTAKLTVES  780
VETVNNLISC TVQKIKAALQ CEKCDSVSWD EWGY
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.20200.0boll| leaf| ovule
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankEU5834970.0EU583497.1 Gossypium hirsutum homeodomain protein GL2-like 1 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016735217.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0D2PT370.0A0A0D2PT37_GOSRA; Uncharacterized protein
STRINGGorai.008G117200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]