PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen02g024970.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 730aa    MW: 79970.4 Da    PI: 5.4306
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen02g024970.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.61e-1860115156
                       TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       +++ +++t++q++e+e++++++++p+ ++r+eL ++lgL+  qVk+WFqN+R+++k
  Sopen02g024970.1  60 KKRYHRHTQHQIQEMESFYKECNHPDDKQRKELGRRLGLEPLQVKFWFQNKRTQMK 115
                       688999***********************************************998 PP

2START216.87.5e-682504681206
                       HHHHHHHHHHHHHHHHC-TT-EEEE..EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
             START   1 elaeeaaqelvkkalaeepgWvkss..esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                       ela +a++el+++a+ eep+W  ss  e + ++e+ ++f+++ +      ++ea+r+s+vv+m++ +lve+l+d++ qW++ +a    ka+tl
  Sopen02g024970.1 250 ELAVSAMEELIRMAQTEEPLWLPSSgsETLCEQEYARIFPRGLGpkpatLNSEASRESAVVIMNHINLVEILMDVN-QWTTVFAglvsKAMTL 341
                       57899********************999999********99888********************************.**************** PP

                       EEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE CS
             START  83 evissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwve 168
                       ev+s+g      galq+m+ae+q++splvp R+ +f+Ry++q+g+g+wv+vdvS+d+ + ++    v R++++pSg+li++++ng+s+v+wve
  Sopen02g024970.1 342 EVLSTGvagnhnGALQVMTAEFQVPSPLVPtRENYFLRYCKQHGEGTWVVVDVSLDNLRTVS----VPRCRRRPSGCLIQEMPNGYSRVIWVE 430
                       ************************************************************97....8************************** PP

                       -EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START 169 hvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                       hv+++++ +h ++++lv+sg+a+gak+wvatl+rqce+
  Sopen02g024970.1 431 HVEVDENAVHDIYKPLVNSGIAFGAKRWVATLDRQCER 468
                       ************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.1E-2144119IPR009057Homeodomain-like
SuperFamilySSF466892.21E-1845117IPR009057Homeodomain-like
PROSITE profilePS5007115.7157117IPR001356Homeobox domain
SMARTSM003893.0E-1658121IPR001356Homeobox domain
PfamPF000462.0E-1660115IPR001356Homeobox domain
CDDcd000861.95E-1660118No hitNo description
PROSITE profilePS5084847.342241471IPR002913START domain
SuperFamilySSF559619.62E-37242470No hitNo description
CDDcd088759.16E-126245467No hitNo description
SMARTSM002348.2E-63250468IPR002913START domain
PfamPF018522.1E-57251468IPR002913START domain
Gene3DG3DSA:3.30.530.202.1E-5317468IPR023393START-like domain
SuperFamilySSF559611.92E-24488719No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 730 aa     Download sequence    Send to blast
MFNSHQHLLD ISSSAQRTPD NELDFIRDEE FDSNSGADNM EAPNSGDDDQ ADPNQPPNKK  60
KRYHRHTQHQ IQEMESFYKE CNHPDDKQRK ELGRRLGLEP LQVKFWFQNK RTQMKAQHER  120
CENTQLRNEN EKLRAENIRY KEALSNAACP NCGGPAAIGE MSFDEHQLRI ENARLRDEID  180
RITGIAGKYV GKSALGYSHQ LSLPQPEAPR VLDLAFGPQS GLLGEMYAAG DLLRTAVTGL  240
TDAEKPVVIE LAVSAMEELI RMAQTEEPLW LPSSGSETLC EQEYARIFPR GLGPKPATLN  300
SEASRESAVV IMNHINLVEI LMDVNQWTTV FAGLVSKAMT LEVLSTGVAG NHNGALQVMT  360
AEFQVPSPLV PTRENYFLRY CKQHGEGTWV VVDVSLDNLR TVSVPRCRRR PSGCLIQEMP  420
NGYSRVIWVE HVEVDENAVH DIYKPLVNSG IAFGAKRWVA TLDRQCERLA SVLALNIPTG  480
DVGIITSPAG RKSMLKLAER MVMSFCAGVG ASTTHIWTTL SGSGADDVRV MTRKSIDDPG  540
RPPGIVLSAA TSFWLPVSPK RVFDFLRDEN SRNEWDILSN GGIVQEMAHI ANGRDPGNCV  600
SLLRVNTGTN SNQSNMLILQ ESTTDVTGSY VIYAPVDIAA MNVVLGGGDP DYVALLPSGF  660
AILPDGPINY HGGGNSEIDS PGGSLLTVAF QILVDSVPTA KLSLGSVATV NSLIKCTVEK  720
IKGAVTSANA
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the L1 box DNA sequence 5'-TAAATG[CT]A-3'. Plays a role in maintaining the identity of L1 cells, possibly by interacting with their L1 box or other target-gene promoters. Functionally redundant to ATML1. {ECO:0000269|PubMed:12505995}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJF5187800.0JF518780.1 UNVERIFIED: Solanum lycopersicum GL2 protein-like mRNA, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015066898.10.0homeobox-leucine zipper protein PROTODERMAL FACTOR 2
RefseqXP_027770690.10.0homeobox-leucine zipper protein PROTODERMAL FACTOR 2
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLA0A3Q7F4X20.0A0A3Q7F4X2_SOLLC; Uncharacterized protein
STRINGSolyc02g080260.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA9322491
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]