PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim06g035940.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 677aa    MW: 75633.6 Da    PI: 5.5787
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim06g035940.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox55.88e-183080353
                        --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
            Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRa 53
                        +R++f+ +q++eLe++F+ +++p++++++eLA+k +++++qV++WFqN+R 
  Sopim06g035940.0.1 30 RRHKFSVNQIHELESVFKVSSHPDEKTKQELATKFSVDKKQVQFWFQNKRS 80
                        89************************************************7 PP

2START164.38.5e-522094312205
                         HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
               START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                         la++a++el+k+a ++ep+Wv+      e++n +e+ ++f +  +     +++ea r sg v +++ +lve l++ + qW e ++    k+
  Sopim06g035940.0.1 209 LALDALNELLKLAMSDEPLWVRNLdgggEMLNMEEYATTFIPIIGikpshFTTEATRSSGTVVGNSLTLVEMLMNES-QWVEAFPciigKV 298
                         7899************************************88777999999**************************.************* PP

                         EEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEE CS
               START  80 etlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghsk 163
                          t +vis+g      g+l l+ aelq +s +vp R+  f+R+++++ +++w+ivdvSvd  ++ +++ ++ ++++lpSg++i+++sng+sk
  Sopim06g035940.0.1 299 NTFDVISTGigegksGTLLLIEAELQIISNVVPvREIKFLRFCQKHAEDSWIIVDVSVDTIKEGSQQYKIEKCRRLPSGCIIQDMSNGYSK 389
                         *****************************************************************9************************* PP

                         EEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
               START 164 vtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                         vtw+eh+++++  ++ l+r+l++ gl +ga++w++ lqrq e
  Sopim06g035940.0.1 390 VTWIEHMEYDEIFVDHLYRPLIRAGLGFGAQRWMSSLQRQSE 431
                         ***************************************976 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.609.9E-161879IPR009057Homeodomain-like
SuperFamilySSF466896.94E-151985IPR009057Homeodomain-like
PROSITE profilePS5007115.6132585IPR001356Homeobox domain
SMARTSM003896.6E-132789IPR001356Homeobox domain
PfamPF000463.7E-153080IPR001356Homeobox domain
CDDcd000862.96E-143086No hitNo description
PROSITE profilePS5084839.771199435IPR002913START domain
SuperFamilySSF559611.22E-25201430No hitNo description
SMARTSM002341.8E-35208432IPR002913START domain
PfamPF018526.6E-45209431IPR002913START domain
CDDcd088751.03E-96216431No hitNo description
SuperFamilySSF559612.2E-5458644No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 677 aa     Download sequence    Send to blast
MEKSTMELSN NRDGGVSGDE LTSPDGSSER RHKFSVNQIH ELESVFKVSS HPDEKTKQEL  60
ATKFSVDKKQ VQFWFQNKRS ISKTQSERYN KRVLQQENEK LRTEYAAMRE VMKKSICDPC  120
RNKDTTIRNE NVDEKEILNE HARLKDELAR IAIHADKSLG SSSFLEGSLT SMMEKFGLEL  180
NEVDFGKYLS SPLPTNLDVT LDKSMLLNLA LDALNELLKL AMSDEPLWVR NLDGGGEMLN  240
MEEYATTFIP IIGIKPSHFT TEATRSSGTV VGNSLTLVEM LMNESQWVEA FPCIIGKVNT  300
FDVISTGIGE GKSGTLLLIE AELQIISNVV PVREIKFLRF CQKHAEDSWI IVDVSVDTIK  360
EGSQQYKIEK CRRLPSGCII QDMSNGYSKV TWIEHMEYDE IFVDHLYRPL IRAGLGFGAQ  420
RWMSSLQRQS ELLRVMASFV NSTVDPKGEI GMGILSQRMT RSFCAVICAT SHKWITIQKE  480
NGKDANLMMR RNISDAGEPI GVILSATKTI QLPIKSQCLF QFFTNKNLRR QWDILSCSGA  540
MENIIHINKD ENLESSVSLL CANGGANENN MMIFQDTCTD ATGSLLVYAI VDSSKMNTVM  600
KGGDPSCVEL LPNGISILPD LSANNNKEFG SGSLVTIMFQ MLVDNISTAD LPQKSIVDAN  660
DIISHTIHKI KSALLI*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755180.0HG975518.1 Solanum lycopersicum chromosome ch06, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004240735.10.0homeobox-leucine zipper protein HDG1 isoform X2
TrEMBLA0A3Q7HLS60.0A0A3Q7HLS6_SOLLC; Uncharacterized protein
STRINGSolyc06g035940.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA2092233
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.11e-177HD-ZIP family protein