PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400025166
Common NameLOC102579878
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 813aa    MW: 88078.6 Da    PI: 6.5057
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400025166genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.51.5e-20119173256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           +k +++t++q++eLe++F++n++p++++r eL k+l L+ rqVk+WFqNrR+++k
  PGSC0003DMP400025166 119 KKYHRHTPYQIQELEACFKENPHPDEKARLELGKRLTLESRQVKFWFQNRRTQMK 173
                           78899***********************************************999 PP

2START156.22.6e-493265481205
                           HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S... CS
                 START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla... 77 
                           ela + ++el+k+a+ ++p+W +      e++n +e+ ++f++  +     +s ea +a+g v m++  lve+l+d++ +W   +    
  PGSC0003DMP400025166 326 ELAFASMNELIKLADIGAPLWLRNFdgsaEELNLEEYARSFPPCIGrkpahFSAEATKATGTVMMNSLALVESLMDTS-RWMDIFSciv 413
                           578999*****************9999999************998899******************************.********** PP

                           .EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEEC CS
                 START  78 .kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepks 158
                            + +t++vis+       g l+l++ae+q+ls lvp R++ ++R+++q+ +g+wv+vdvS+d  q+ +  +    +++lpSg+++++++
  PGSC0003DMP400025166 414 gRTSTINVISNSsggskdGNLHLIQAEFQVLSALVPvRKVKYLRFCKQHAEGVWVVVDVSIDAIQEGS-IPLDGNCRRLPSGCIVQDLP 501
                           **********99999*****************************************************.58888999************ PP

                           TCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                 START 159 nghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                           ng skv+w+eh+++++++ h+++++ ++sgl +ga++w atlqrqce
  PGSC0003DMP400025166 502 NGCSKVIWIEHTEYDESITHNYYHPYIRSGLGFGAQRWIATLQRQCE 548
                           **********************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.41E-19104175IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.5E-22113183IPR009057Homeodomain-like
PROSITE profilePS5007116.925115175IPR001356Homeobox domain
SMARTSM003891.8E-16117179IPR001356Homeobox domain
PfamPF000464.0E-18119173IPR001356Homeobox domain
CDDcd000866.94E-17119176No hitNo description
PROSITE profilePS5084836.904317552IPR002913START domain
SuperFamilySSF559616.18E-26320548No hitNo description
CDDcd088757.41E-103321548No hitNo description
SMARTSM002347.6E-38326549IPR002913START domain
PfamPF018525.7E-42327548IPR002913START domain
SuperFamilySSF559613.98E-11590778No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 813 aa     Download sequence    Send to blast
MSFGGFIGSS SGGNGGSGVS RLVGDSSYEA MPTATMAQSQ LITSSLSHSM FNSSPLSLAL  60
KPKMEGAGDL SFDAAAVMGR NSRDDEYESR SGTGSDNLDG VGSGDEMETH IGSSSKSAKK  120
YHRHTPYQIQ ELEACFKENP HPDEKARLEL GKRLTLESRQ VKFWFQNRRT QMKTQMERHE  180
NSMLKQENDK LRIENIAMKD AMRSPACPHC GGQAILGEIH IEEHHLKIEN ARLRDEYNRI  240
CVVANKFLGR PSESFHGPMS AGMANSGLEL AVGRNGYGAM NSVDTALPMG LNFGNNFSSA  300
LPAISPRPTL SMAGVGVSCD KNMLMELAFA SMNELIKLAD IGAPLWLRNF DGSAEELNLE  360
EYARSFPPCI GRKPAHFSAE ATKATGTVMM NSLALVESLM DTSRWMDIFS CIVGRTSTIN  420
VISNSSGGSK DGNLHLIQAE FQVLSALVPV RKVKYLRFCK QHAEGVWVVV DVSIDAIQEG  480
SIPLDGNCRR LPSGCIVQDL PNGCSKVIWI EHTEYDESIT HNYYHPYIRS GLGFGAQRWI  540
ATLQRQCEFL AIMSSAVPSG DNSVVSSSGR RSIAVLARRV TRSFCVGVCA TYYDWESIQS  600
GTAEESKLIM RKGVGEPGDP NGMVLSASRS LWLPVTHQRL FDFLRNEQTR SQWDVLSQGG  660
SVHPIVHIGK GQDLGNSITL FRTSVANSDG SQNSMLTLQE SCTDVSGSII AYTSLNSGDM  720
NVVMSGGDSS CVTFLPSGFA IIPDCYENSN GVAAGNGILE NGGKINGCLL TMGFQILMTN  780
PPTGTLTMDS VNTVNSLITR TVQNIKLAFQ CN*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400025166
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754420.0HG975442.1 Solanum pennellii chromosome ch03, complete genome.
GenBankHG9755150.0HG975515.1 Solanum lycopersicum chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006353342.10.0PREDICTED: homeobox-leucine zipper protein ROC5
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLM1B4R10.0M1B4R1_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000370550.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA16262465
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]