PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araha.3868s0004.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family HD-ZIP
Protein Properties Length: 804aa    MW: 87768.3 Da    PI: 6.6682
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araha.3868s0004.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.27.6e-20112167156
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          +++ +++t++q++ Le++F+++ +p++++r +L+++l+L+ rqVk+WFqNrR+++k
  Araha.3868s0004.1.p 112 KKRYHRHTPKQIQDLESVFKECAHPDEKQRLDLSRRLNLDPRQVKFWFQNRRTQMK 167
                          688999***********************************************999 PP

2START180.11.3e-563125312206
                          HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
                START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                          la++a++elvk+a+ ++p+W +ss    e +n++e+ ++f++  +     + +ea+++ g v+ ++  lve+l+d+  +W e+++    +
  Araha.3868s0004.1.p 312 LALAAMDELVKMAQTRDPLWARSSdtgiEVLNQEEYDTSFTRCVGpkpdgFVSEASKEAGTVIINSLALVETLMDSE-RWAEMFPsmisR 400
                          6899********************999966666666666644333667889**************************.*******9999* PP

                          EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE CS
                START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh 161
                           +t+e issg      gal+lm aelq+lsplvp R + f+R+++q+ +g+w++vdvS+ds ++ + sss+ R   lpSg+l+++++ng 
  Araha.3868s0004.1.p 401 TSTTEIISSGmggsrnGALHLMHAELQLLSPLVPvRQVSFLRFCKQHAEGVWAVVDVSIDSIREGS-SSSCRR---LPSGCLVQDMANGC 486
                          *****************************************************************9.777766...************** PP

                          EEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 162 skvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          skvtw+eh++++++++h l+r+l++ gla+ga +w+a+lqrqce+
  Araha.3868s0004.1.p 487 SKVTWIEHTEYDENHIHRLYRPLLSCGLAFGAHRWMAALQRQCEC 531
                          *******************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.2E-2086170IPR009057Homeodomain-like
SuperFamilySSF466892.88E-19100169IPR009057Homeodomain-like
PROSITE profilePS5007117.086109169IPR001356Homeobox domain
SMARTSM003891.9E-18110173IPR001356Homeobox domain
CDDcd000861.13E-17112169No hitNo description
PfamPF000461.8E-17112167IPR001356Homeobox domain
PROSITE patternPS000270144167IPR017970Homeobox, conserved site
PROSITE profilePS5084839.771302534IPR002913START domain
SuperFamilySSF559611.65E-30305531No hitNo description
CDDcd088751.99E-111306530No hitNo description
SMARTSM002344.1E-44311531IPR002913START domain
PfamPF018525.8E-49312531IPR002913START domain
SuperFamilySSF559611.18E-17560795No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 804 aa     Download sequence    Send to blast
MNFNGFLDDG AGASKLLSDV PYNNHFSFSA VDTTMLGTTA IAPPHSRPFS SSGLSLGLQT  60
NGEMSRNGEI FESNITRKSS RGEDVESRSE SDNAEAVSGD DLDTSDRPLK KKKRYHRHTP  120
KQIQDLESVF KECAHPDEKQ RLDLSRRLNL DPRQVKFWFQ NRRTQMKTQI ERHENALLRQ  180
ENDKLRAENM SVREAMRNPM CGNCGGPAVI GEISMEEQHL RIENSRLKDE LDRVCALTGK  240
FLGRSNGSHH IPDSALVLGV GGGFTLSSPV LPQASPRFEI SNATGSGFVA TVNRQPPVGV  300
SDFDQRSRYL DLALAAMDEL VKMAQTRDPL WARSSDTGIE VLNQEEYDTS FTRCVGPKPD  360
GFVSEASKEA GTVIINSLAL VETLMDSERW AEMFPSMISR TSTTEIISSG MGGSRNGALH  420
LMHAELQLLS PLVPVRQVSF LRFCKQHAEG VWAVVDVSID SIREGSSSSC RRLPSGCLVQ  480
DMANGCSKVT WIEHTEYDEN HIHRLYRPLL SCGLAFGAHR WMAALQRQCE CLTILMSSTV  540
SPSPNPTPIN CNGRKSMLKL AKRMTDNFCG GVCASSLQKW SKLNVGNVDE DVRIMTRKSV  600
NNPGEPPGII LNAATSVWMP VSPRRLFDFL GNERLRSEWD ILSNGGPMKE MAHIAKGHDH  660
SNSVSLLRAS AINANQSSML ILQETSIDAA GALVVYAPVD IPAMQAVMNG GDSAYVALLP  720
SGFAILPNAQ AGTQRCAAEE RNSNGNGNGG CMEEGGSLLT VAFQILVNSL PTAKLTVESV  780
ETVNNLISCT VQKIKAALHC DST*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00418DAPTransfer from AT3G61150Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapAraha.3868s0004.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0508660.0AY050866.1 Arabidopsis thaliana putative homeobox protein (At3g61150) mRNA, complete cds.
GenBankAY0967570.0AY096757.1 Arabidopsis thaliana putative homeobox protein (At3g61150) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002876596.10.0homeobox-leucine zipper protein HDG1 isoform X1
SwissprotQ9M2E80.0HDG1_ARATH; Homeobox-leucine zipper protein HDG1
TrEMBLD7LS630.0D7LS63_ARALL; Uncharacterized protein
STRINGfgenesh1_pm.C_scaffold_50020900.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G61150.10.0homeodomain GLABROUS 1
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Horstman A, et al.
    AIL and HDG proteins act antagonistically to control cell proliferation.
    Development, 2015. 142(3): p. 454-64
    [PMID:25564655]