PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400029101
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 499aa    MW: 55730.4 Da    PI: 5.4063
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400029101genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox47.23.9e-1553107256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           +k  +++++q+++L ++F+kn+ p+++   eLA+kl+++ +qV++WFqN+R+++k
  PGSC0003DMP400029101  53 KKPIRHNSDQIQALGSFFKKNSLPNKKVQLELATKLSMDINQVQNWFQNKRTQMK 107
                           55667899********************************************998 PP

2START111.31.4e-351913712163
                           HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.... CS
                 START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla.... 77 
                           la +a++el+++ + ++p+Wv+s     e +n +e+ + f++  +     +++ea +asg+v +++  lv++l+d++ qW e+++    
  PGSC0003DMP400029101 191 LAMDALNELFRLYENDKPLWVSSLdgggETLNIEEYARLFTPLIGtkpehFTTEATKASGIVADTSLALVNTLMDKR-QWVEMFPcivg 278
                           577899********************9999*********999888999*****************************.*********** PP

                           EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECT CS
                 START  78 kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksn 159
                           k  t++vis+g      g l l+++elq +s lv  R++ f+R+++++ +g+w+ivdvSvd  q+ +++s++  +++lpSg+++++++n
  PGSC0003DMP400029101 279 KTYTTDVISTGiggnksGSLLLIKTELQIISDLVYvREVQFLRFCKKHAEGVWAIVDVSVDTIQEGSQQSEIENCRRLPSGCILQDMPN 367
                           ***************************************************************************************** PP

                           CEEE CS
                 START 160 ghsk 163
                           g+++
  PGSC0003DMP400029101 368 GYCQ 371
                           ***7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.605.6E-1544107IPR009057Homeodomain-like
SuperFamilySSF466892.97E-1548112IPR009057Homeodomain-like
PROSITE profilePS5007114.949109IPR001356Homeobox domain
SMARTSM003892.7E-1251113IPR001356Homeobox domain
CDDcd000861.05E-1252109No hitNo description
PfamPF000469.3E-1353107IPR001356Homeobox domain
PROSITE patternPS00027084107IPR017970Homeobox, conserved site
PROSITE profilePS5084825.069181371IPR002913START domain
SuperFamilySSF559611.59E-18184371No hitNo description
CDDcd088752.77E-75185371No hitNo description
SMARTSM002349.0E-17190417IPR002913START domain
PfamPF018526.0E-29192371IPR002913START domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 499 aa     Download sequence    Send to blast
MEGHSDMNER RESLINVVIG GSGEEEIERS SIGDNIIGGA SVDERCESSS KRKKPIRHNS  60
DQIQALGSFF KKNSLPNKKV QLELATKLSM DINQVQNWFQ NKRTQMKSQL KLYENKTLKQ  120
ENDKFRIEHI VMKEALENSI RYNCKEKIDE NQRKIEHDQL EDEVKRLTAQ AYKLSLFNKD  180
VVHDKMVLLN LAMDALNELF RLYENDKPLW VSSLDGGGET LNIEEYARLF TPLIGTKPEH  240
FTTEATKASG IVADTSLALV NTLMDKRQWV EMFPCIVGKT YTTDVISTGI GGNKSGSLLL  300
IKTELQIISD LVYVREVQFL RFCKKHAEGV WAIVDVSVDT IQEGSQQSEI ENCRRLPSGC  360
ILQDMPNGYC QGDDSGANRV ILQDTCMDAT GSLLVYATID SQEINTVMKG GDSSCVTLFS  420
NGITIVPDYF QDFSTTNNYN VISGEMNNGF GGGSLVTISF QMVGNIFPAT TLSMELVKEA  480
NALISHTIHK IKSALKCK*
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400029101
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754451e-46HG975445.1 Solanum pennellii chromosome ch06, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015169582.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like
TrEMBLM1BDS40.0M1BDS4_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000429080.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA1581346
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.27e-94HD-ZIP family protein
Publications ? help Back to Top
  1. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]