PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araha.33550s0002.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family HD-ZIP
Protein Properties Length: 604aa    MW: 67141.3 Da    PI: 5.8993
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araha.33550s0002.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.55.6e-1959113256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                            k +++t++q++eLe++F+++++p++++r eL kkl L+ +q+k+WFqNrR+++k
  Araha.33550s0002.1.p  59 TKYHRHTSYQIQELESFFKECPHPNEKQRLELGKKLTLESKQIKFWFQNRRTQMK 113
                           678899**********************************************999 PP

2START153.32e-482254263206
                           HHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
                 START   3 aeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                           a ea++el+k+a+++  +W+       ++e++  +          +r++g+v  ++  lve+l+d++ +W e+++      +t+evis+
  Araha.33550s0002.1.p 225 AMEAMDELLKLAELDNLLWSAKI----EKESMNHL--------AGSRETGLVLINSLALVETLMDTN-KWAEMFEcivaVGSTVEVISN 300
                           56788888888888888886655....66666666........569*********************.**********9********** PP

                           T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE- CS
                 START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwveh 169
                           g      g lqlm+ae+q++splvp +   f+Ry++q+g+g w++vdvS d +++++  +s+ +++++pSg++i++ +ng skvtw+e 
  Araha.33550s0002.1.p 301 GtdgsrsGSLQLMQAEFQVMSPLVPiKQKKFLRYCKQHGDGLWAVVDVSYDINRENEYLKSYGCSKKFPSGCIIQDIGNGCSKVTWIEY 389
                           ********************************************************99******************************* PP

                           EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 170 vdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                            +++++++h+l+++l++s++  ga +w+atlqrqce+
  Araha.33550s0002.1.p 390 SEYEESHIHSLYQPLLSSSVGLGATKWLATLQRQCES 426
                           ***********************************95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.606.0E-2044115IPR009057Homeodomain-like
SuperFamilySSF466891.21E-1844115IPR009057Homeodomain-like
SMARTSM003895.2E-1453119IPR001356Homeobox domain
PROSITE profilePS5007116.97355115IPR001356Homeobox domain
PfamPF000469.4E-1759113IPR001356Homeobox domain
CDDcd000868.58E-1662115No hitNo description
PROSITE profilePS5084839.207214429IPR002913START domain
SuperFamilySSF559611.79E-28220427No hitNo description
CDDcd088754.36E-99220425No hitNo description
SMARTSM002343.8E-35223426IPR002913START domain
PfamPF018524.4E-42225426IPR002913START domain
SuperFamilySSF559618.06E-7486595No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 604 aa     Download sequence    Send to blast
MNGDLDVDMS RGDFNPSYFL GKLKDDEFES RSLSDDSFDA LSGDEDKQEQ RPKKKKRKTK  60
YHRHTSYQIQ ELESFFKECP HPNEKQRLEL GKKLTLESKQ IKFWFQNRRT QMKTQLERHE  120
NVILKQENEK LRLENSFLKE SMRGSLCIDC GGAVIPGEVS FEQHQLRIEN AKLKDELDRI  180
CALANRFIGG SISLEQPSNG GIGSQHLPIG HSVSGGTSLM FMDLAMEAMD ELLKLAELDN  240
LLWSAKIEKE SMNHLAGSRE TGLVLINSLA LVETLMDTNK WAEMFECIVA VGSTVEVISN  300
GTDGSRSGSL QLMQAEFQVM SPLVPIKQKK FLRYCKQHGD GLWAVVDVSY DINRENEYLK  360
SYGCSKKFPS GCIIQDIGNG CSKVTWIEYS EYEESHIHSL YQPLLSSSVG LGATKWLATL  420
QRQCESFTTL FSSQDHTGLS LAGTKSILKL AQRMKLNFYS GITASSIHKW EKLLAENGKD  480
QNESSMLILQ ETWNDASGAL VVYAPVDIPS MNVVMSGGDS AYVALLPSGF SILPDGSSSS  540
SDQFDTDGGL VNHESKGCLL TVGFQILVNS LPTAKLNVES VETVNNLIAC TIHKIRAALR  600
IPA*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14957QRPKKKKRK
25257KKKKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the DNA sequence 5'-GCATTAAATGC-3'. {ECO:0000269|PubMed:16778018}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00559DAPTransfer from AT5G52170Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapAraha.33550s0002.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0256030.0AB025603.1 Arabidopsis thaliana genomic DNA, chromosome 5, BAC clone:F17P19.
GenBankCP0026880.0CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020871109.10.0homeobox-leucine zipper protein HDG7
SwissprotQ9LTK30.0HDG7_ARATH; Homeobox-leucine zipper protein HDG7
TrEMBLA0A178URB70.0A0A178URB7_ARATH; HDG7
STRINGAT5G52170.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52170.10.0homeodomain GLABROUS 7
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]