PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AA55G00037
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Aethionemeae; Aethionema
Family HD-ZIP
Protein Properties Length: 676aa    MW: 75317 Da    PI: 5.8115
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AA55G00037genomeVEGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.18.2e-2047101256
                 T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 +k +++t++q++eLe++F ++++p++++r eL +klgL+ +q+k+WFqNrR+++k
  AA55G00037  47 SKYHRHTSYQIQELESVFGECPHPNEKQRLELGRKLGLESTQIKFWFQNRRTQMK 101
                 67889***********************************************999 PP

2START134.98.9e-4322240746205
                 EEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECTT...........................EEEEEEEEXXTTXX-SSX.EEEEEE CS
       START  46 ealrasgvvdmvlallveellddkeqWdetla....kaetlevissg...........................galqlmvaelqalsplvp.Rdfvfv 112
                 e +r++g+v   + +lve+l+++k qW e++a     a+t+evis+g                             l +m+a++q++splvp R++ f+
  AA55G00037 222 ECSRETGLVSISSLDLVETLMETK-QWAEMFAcivaVASTIEVISNGsngtrsgslhlvcssdtkdlifsqetdPLLGQMQAKFQVMSPLVPiREVKFL 319
                 679*********************.***********************************************9999*********************** PP

                 EEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
       START 113 RyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                 Ry++ql++  w++vdvS d++++ ++      +++lpSg++i++++ng skvtw+e+ +++++ +h l+r l++s +  ga +w+atlqr c+
  AA55G00037 320 RYCKQLKESFWAVVDVSYDVKKEDEK-----WCRRLPSGCIIQDMGNGCSKVTWIEQSEYDESYVHPLYRTLLSSAVGLGATRWLATLQRECQ 407
                 ***********************996.....6679********************************************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.71E-1931103IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.9E-2133103IPR009057Homeodomain-like
PROSITE profilePS5007116.92543103IPR001356Homeobox domain
SMARTSM003893.5E-1744107IPR001356Homeobox domain
CDDcd000861.01E-1646103No hitNo description
PfamPF000461.6E-1747101IPR001356Homeobox domain
SMARTSM002343.1E-23177408IPR002913START domain
CDDcd088751.55E-83196407No hitNo description
SuperFamilySSF559619.61E-25196269No hitNo description
PfamPF018522.6E-36222407IPR002913START domain
PROSITE profilePS5084830.827225411IPR002913START domain
SuperFamilySSF559619.61E-25299409No hitNo description
SuperFamilySSF559612.88E-17436669No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 676 aa     Download sequence    Send to blast
MKVLASPAKE KMKGNGDLNL KDDEFDSFDG AMSGDDKEEE APRKKKSKYH RHTSYQIQEL  60
ESVFGECPHP NEKQRLELGR KLGLESTQIK FWFQNRRTQM KTQLERHENV ILRQENEKLR  120
VENRYLKEAM RVPICSDCTG EVAPCEVSYD HQLKIENSKL KEELDRICTL TNRYIGSQNL  180
PVRPDFNGGS VMYESSMFME LAVKAMDELI KLADPGFVIV PECSRETGLV SISSLDLVET  240
LMETKQWAEM FACIVAVAST IEVISNGSNG TRSGSLHLVC SSDTKDLIFS QETDPLLGQM  300
QAKFQVMSPL VPIREVKFLR YCKQLKESFW AVVDVSYDVK KEDEKWCRRL PSGCIIQDMG  360
NGCSKVTWIE QSEYDESYVH PLYRTLLSSA VGLGATRWLA TLQRECQSLT TLFSCLNPVQ  420
DFAGLSPAGT KSALKLAKRM TFNFYSGITA SSSRRWEKLL AGNVGEDTRI LTRKNIDDPG  480
EPYGVILSAA TSLWLPVSHQ KLFDFLRDEK HRHHWDILCN GALMDDMLLV PKGERDGSCC  540
VSLLRAAGKD KTENSMLILQ ETWIDTSGAL VVYAPVDVSS MNGVMNGGDT ACVALLPSGF  600
SILPDGSSSS DQICSNGGLV HPESKENSSG GSLLTVGFQI LVNSLPTAIL NLDSLDTVNN  660
LISCTIHKIR SALHVP
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the DNA sequence 5'-GCATTAAATGC-3'. {ECO:0000269|PubMed:16778018}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00559DAPTransfer from AT5G52170Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapAA55G00037
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_019091179.10.0PREDICTED: homeobox-leucine zipper protein HDG7-like isoform X2
SwissprotQ9LTK30.0HDG7_ARATH; Homeobox-leucine zipper protein HDG7
TrEMBLA0A1P8BG000.0A0A1P8BG00_ARATH; Homeodomain GLABROUS 7
STRINGXP_010444530.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52170.10.0homeodomain GLABROUS 7
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]