PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400052108
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 671aa    MW: 74950.1 Da    PI: 4.9069
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400052108genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.17.2e-1956107253
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRa 53 
                           +kR++++++q++eLe++F++n++p++++r eLA+k ++ ++qV++WFqN+R 
  PGSC0003DMP400052108  56 SKRHKYSDNQIQELEAVFKENSHPDEKTRLELATKFSVGKKQVQFWFQNKRS 107
                           69*************************************************7 PP

2START392.1e-13256361289
                           HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S... CS
                 START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla... 77 
                           la++a++el+k+a +++p+Wv+s     e++n +e+ ++f  s +      +++ea r sg v +++ +lve l++ + qW e ++   
  PGSC0003DMP400052108 256 LALAALNELLKLAMSDKPFWVRSLdgggEILNMEEYARSF-ISITgikpshFTTEATRSSGTVAGNSLTLVEMLMNES-QWVEVFPcii 342
                           7899************************999*99999999.55556999999**************************.********** PP

                           .EEEEEEEECTT......E CS
                 START  78 .kaetlevissg......g 89 
                            k+ t +vis+g      g
  PGSC0003DMP400052108 343 gKVNTFDVISTGigesksG 361
                           ***********97655554 PP

3START23.89.3e-09366407164205
                           EEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                 START 164 vtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                           v+w+eh+++++  +h l+r+l++ gl +ga++w + +qrq e
  PGSC0003DMP400052108 366 VIWIEHMEYDEIFVHHLYRPLIRVGLGFGAQRWISSFQRQSE 407
                           8*************************************9965 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.0E-1740112IPR009057Homeodomain-like
PROSITE profilePS5007115.54852112IPR001356Homeobox domain
SuperFamilySSF466893.05E-1653112IPR009057Homeodomain-like
SMARTSM003894.7E-1454116IPR001356Homeobox domain
PfamPF000462.8E-1656107IPR001356Homeobox domain
CDDcd000861.08E-1557112No hitNo description
PROSITE profilePS5084815.145246365IPR002913START domain
PfamPF018521.4E-8256356IPR002913START domain
PfamPF018521.2E-4366406IPR002913START domain
SuperFamilySSF559612.2E-5432629No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 671 aa     Download sequence    Send to blast
MDDHNDTTET TERRECLVNL MIRGPREDQN EDTFVTNNRD GGASGDELNS PHGISSKRHK  60
YSDNQIQELE AVFKENSHPD EKTRLELATK FSVGKKQVQF WFQNKRSISK TQLERHDKKI  120
LQQENEKLCL EYAAMKEVME NSICDPCRNK DTIRKENIDE KEILNEHARL KNELARIAIH  180
ADNFLGPSTF LEGSLTSMMK NFGLELLTGR DETSDVNVVD GLSLNEVDFG KYLSSPPPTN  240
LVNKDLTLDK SMLLNLALAA LNELLKLAMS DKPFWVRSLD GGGEILNMEE YARSFISITG  300
IKPSHFTTEA TRSSGTVAGN SLTLVEMLMN ESQWVEVFPC IIGKVNTFDV ISTGIGESKS  360
GTLLLVIWIE HMEYDEIFVH HLYRPLIRVG LGFGAQRWIS SFQRQSEFLR VMASFVDSTV  420
DSKGEIGMGI LAQRMTRNFC AGICATSHKW KTIQIENEKD ANLMMRKNIS DPGEPIGVVL  480
SATKTIQLPI KPQCLFEFFT NNNMRIQWDI LSCSGPMKNI IHITKGQNLE SCVSLLCANG  540
DDIIANQNNM LIFQDTCTDA TGSLLVYAIV DSSQMNIVMK GGDSSCVELL PDGISIVPDL  600
SQDYYANNND GGNNNEFCSG SLVTIMFQMF VGNLSTTDLL EKSIIDANDI ISHTIHIRSK  660
LLSNASDSSR *
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400052108
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755181e-111HG975518.1 Solanum lycopersicum chromosome ch06, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLM1CXK40.0M1CXK4_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000769020.0(Solanum tuberosum)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.11e-79HD-ZIP family protein
Publications ? help Back to Top
  1. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]