PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sme2.5_00207.1_g00003.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 613aa    MW: 67709.5 Da    PI: 4.6178
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sme2.5_00207.1_g00003.1genomeEGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox43.94e-1450872057
                             HHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
                 Homeobox 20 eknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57
                             ++++ p++++reeL+ klgL+ +qVk+WFqN+R k k+
  Sme2.5_00207.1_g00003.1 50 QECPQPDQKTREELSCKLGLESSQVKFWFQNKRSKTKN 87
                             6799*******************************995 PP

2START1259.3e-401623605206
                              HHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHH.HHHHHHCCCGGCT-TT-SEEEEEEEE CS
                    START   5 eaaqelvkkalaeepgWvkss....esengdevlqkfeeskvdsgealrasgvvdmvla.llveellddkeqWdetlakaetlevi 85 
                               a++el+++a+ +ep+W++s     + +n +e+ ++f+++   ++ +  ++++v  ++a +  +++      +   +  a t++v+
  Sme2.5_00207.1_g00003.1 162 GAMEELLQLAEMGEPLWFSSIdgvnDLLNIEEYNRRFARG--NESMPNGIKTAVSRETAlNHWASF------FAYIVLTACTMNVL 239
                              589******************999999**********886..468888888888888882333333......3334669******* PP

                              CTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEE CS
                    START  86 ssg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskv 164
                              s+g      g  + ++ae+q++sp vp Rd +fvR++++   g w+ivdvS+d+   p+      R++++pSg++i++ sn +skv
  Sme2.5_00207.1_g00003.1 240 SNGvpgnldGSMEMIYAEFQVPSPQVPnRDCYFVRFCKRIANGLWAIVDVSLDNA--PT-----TRCRKRPSGCIIQQISNSYSKV 318
                              *************************************************999864..54.....5********************* PP

                              EEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                    START 165 twvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                              tw+eh++ ++ ++ ++++ +++s+la+gak+w   l++qce+
  Sme2.5_00207.1_g00003.1 319 TWIEHIEADDTSVNSIYKAFLNSSLAFGAKRWIGILNKQCER 360
                              ****************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM003893.6E-63492IPR001356Homeobox domain
PfamPF000461.3E-115086IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.4E-135096IPR009057Homeodomain-like
SuperFamilySSF466898.58E-115090IPR009057Homeodomain-like
CDDcd000863.12E-115088No hitNo description
PROSITE profilePS5007112.4065388IPR001356Homeobox domain
PROSITE patternPS0002706386IPR017970Homeobox, conserved site
PROSITE profilePS5084827.078149363IPR002913START domain
SMARTSM002349.1E-22158360IPR002913START domain
CDDcd088756.02E-82161359No hitNo description
SuperFamilySSF559613.02E-22162360No hitNo description
PfamPF018521.3E-31163360IPR002913START domain
Gene3DG3DSA:3.30.530.207.9E-6246356IPR023393START-like domain
SuperFamilySSF559613.57E-12384581No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 613 aa     Download sequence    Send to blast
MDENISSANK NSSIIENDSP SAGLPLGQNS SLIENDPKFV PSSSELQVVQ ECPQPDQKTR  60
EELSCKLGLE SSQVKFWFQN KRSKTKNQDK RDENSSLRAE NEKLRAECLW LSEAINNGCP  120
SCGIPSFRLG ETANIEQYLR LENARLQEEV VRISRTYNVV MGAMEELLQL AEMGEPLWFS  180
SIDGVNDLLN IEEYNRRFAR GNESMPNGIK TAVSRETALN HWASFFAYIV LTACTMNVLS  240
NGVPGNLDGS MEMIYAEFQV PSPQVPNRDC YFVRFCKRIA NGLWAIVDVS LDNAPTTRCR  300
KRPSGCIIQQ ISNSYSKVTW IEHIEADDTS VNSIYKAFLN SSLAFGAKRW IGILNKQCER  360
LASAEAPNSF KNDIYYRRVS GTTTHRWTTL TESGYNVNGI QVMTTQSIND PGRPCGIVLS  420
SSISIWLPVP PNILFDFLRD ENIRGEWDVL SNGGSMKEVI HIANGRETGN CVSILRVNSS  480
NPGQSNLSII QESMSDPSGS FIVYAPIDIL EIEKVLCGGS PDDVPLLPSG FAILPDGPLG  540
VSSSGSLLTV SFQILAASVP TTDVSPQSVT AVENLMICTI DKINNALFSN LPSMLDNVKE  600
MEHSREEVDI GII
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006342404.10.0PREDICTED: homeobox-leucine zipper protein ROC7-like
SwissprotA2YR021e-179ROC7_ORYSI; Homeobox-leucine zipper protein ROC7
TrEMBLM1BN380.0M1BN38_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000489710.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA1721855
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.11e-161protodermal factor 2