PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400013651
Common NameLOC102604504
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 810aa    MW: 89496.9 Da    PI: 6.6078
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400013651genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.75.5e-20123178156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           r++ +++t++q+++Le lF+++++p++++r eL+++l L++rqVk+WFqNrR+++k
  PGSC0003DMP400013651 123 RKRYHRHTPQQIQQLELLFKECPHPDEKQRMELSRRLCLETRQVKFWFQNRRTQMK 178
                           789999***********************************************999 PP

2START199.21.8e-623125391206
                           HHHHHHHHHHHHHHHHC-TT-EEEE.....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.. CS
                 START   1 elaeeaaqelvkkalaeepgWvkss.....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla.. 77 
                           ela++a++el+k+a+ ++p+W + +     e +n de+++kf++  +     + +e +r++g+v+ ++  lve+l+d++ +W e+++  
  PGSC0003DMP400013651 312 ELALAAMDELIKMAKTDDPLWLRNRelcggEVLNHDEYMRKFTPCIGlkpikFVSEGSRETGMVIINSLALVETLMDSN-KWAEMFPcl 399
                           5899****************************************999****99**************************.********* PP

                           ..EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--...-TTSEE-EESSEEEEEE CS
                 START  78 ..kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe..sssvvRaellpSgilie 155
                             + +t++vissg      galqlm +elq+lsplvp R+f f+R+++q+ +g+w++vdvSvd  ++ ++     +  +++lpSg++++
  PGSC0003DMP400013651 400 iaSTSTIDVISSGvggtrnGALQLMRSELQVLSPLVPiREFKFLRFCKQHAEGVWAVVDVSVDTIRETTTldATTFSNCRRLPSGCVVQ 488
                           *******************************************************************99999999************** PP

                           EECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 156 pksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                           +++ng+sk+twvehv+++++++h+l+r+l++ g+ +ga++wvatlqrqce+
  PGSC0003DMP400013651 489 DMPNGYSKITWVEHVEYDESVVHQLYRPLISAGMGFGAQKWVATLQRQCEC 539
                           *************************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.604.1E-21108174IPR009057Homeodomain-like
SuperFamilySSF466895.56E-20110180IPR009057Homeodomain-like
PROSITE profilePS5007117.443120180IPR001356Homeobox domain
SMARTSM003891.6E-15121184IPR001356Homeobox domain
CDDcd000862.02E-17122180No hitNo description
PfamPF000461.9E-17123178IPR001356Homeobox domain
PROSITE patternPS000270155178IPR017970Homeobox, conserved site
PROSITE profilePS5084844.206303542IPR002913START domain
SuperFamilySSF559617.42E-31306539No hitNo description
CDDcd088753.97E-117307538No hitNo description
SMARTSM002341.2E-47312539IPR002913START domain
PfamPF018521.5E-54312539IPR002913START domain
SuperFamilySSF559611.28E-22558792No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 810 aa     Download sequence    Send to blast
MNFGDFLDNS IIGDGNGGGA RIVTNSNNMR SNNNMSISTN LPSLAKSMFN SPRLSLALQT  60
GREGGPGGVA VMAEENYYEA NDNNNNIIGR RSIEKEQAES RSGSENLEGA SGDDEDDKPQ  120
RKRKRYHRHT PQQIQQLELL FKECPHPDEK QRMELSRRLC LETRQVKFWF QNRRTQMKTQ  180
LERHENSFLR QENDKLRAEN MSIREAIMNP ICTTCSGPAI IGEVSFEEQH LRIENSRLKD  240
ELDRVNALAG KFIGRPISLP LPNSTLELEV GNNGFRAKPD FGVGISNPLP VLPHTRQTTG  300
IEMSFDRSVY LELALAAMDE LIKMAKTDDP LWLRNRELCG GEVLNHDEYM RKFTPCIGLK  360
PIKFVSEGSR ETGMVIINSL ALVETLMDSN KWAEMFPCLI ASTSTIDVIS SGVGGTRNGA  420
LQLMRSELQV LSPLVPIREF KFLRFCKQHA EGVWAVVDVS VDTIRETTTL DATTFSNCRR  480
LPSGCVVQDM PNGYSKITWV EHVEYDESVV HQLYRPLISA GMGFGAQKWV ATLQRQCECL  540
AILMSSTVPS RDHTALTPSG RRSMLKLAQR MTNNFCSGVC ASSIHKWNKL NCVGNNVEDY  600
VRVLTRKSVD DPGEPPGIVV NAATSVWLPV SPQRLFEFLR DEQLRSEWDI LSNGGPMQEM  660
AHIAKGQDHG NCVSLLRASV MNASQNMLIL QETCTDASGS LVVYAPVDIP SMHLVMNGGD  720
SAYVALLPSG FSIVPDGPGS RGPNLVKSLN NGPGPGPDMR VSGSLLTVAF QILVNSLPTA  780
KLTVESVETV NNLISCTLQK IKGALHCES*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1116122DKPQRKR
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400013651
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006366174.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLM1ACV50.0M1ACV5_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000199860.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA16262465
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]