PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sme2.5_05186.1_g00001.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family Trihelix
Protein Properties Length: 662aa    MW: 73335.3 Da    PI: 7.4564
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sme2.5_05186.1_g00001.1genomeEGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.21.1e-2870153186
                 trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                              rW++ e+laL+++r+em+  +++++lk+plWe+vs+km++ g++rs+k+Ckek+en+ k+++++keg+ ++   + +t+++fdql+
  Sme2.5_05186.1_g00001.1  70 RWPRPETLALLKIRSEMDVVFKDSSLKGPLWEQVSRKMADLGYHRSAKKCKEKFENVYKYHRRTKEGRASK--ADGKTYRFFDQLQ 153
                              8********************************************************************96..66678******98 PP

2trihelix99.33.3e-31458543187
                 trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                              rW+k+e+ aLi++r+ ++ +++++  k+plWee+s  mr+ g++r++k+Ckekwen+nk++kk+ke++k+r +e+s+tcpyf+qle
  Sme2.5_05186.1_g00001.1 458 RWPKEEIEALIRLRTCLDLKYQDNGPKGPLWEEISVGMRKIGYNRNAKRCKEKWENINKYFKKVKESNKRR-PEDSKTCPYFNQLE 542
                              8********************************************************************97.99***********8 PP

                 trihelix  87 a 87 
                              a
  Sme2.5_05186.1_g00001.1 543 A 543
                              5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.81863127IPR017877Myb-like domain
SMARTSM007170.01467129IPR001005SANT/Myb domain
CDDcd122031.13E-2369134No hitNo description
PfamPF138371.1E-1869155No hitNo description
SMARTSM007170.0034455517IPR001005SANT/Myb domain
CDDcd122038.84E-27457522No hitNo description
PfamPF138371.2E-20457544No hitNo description
PROSITE profilePS500907.282457515IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 662 aa     Download sequence    Send to blast
MLGVSSGLIA STTTTAGDGM AAPPPIPTRP PQEAPESGGS SEGGGGGGGD MCVGGGGFGE  60
EGERNSGGNR WPRPETLALL KIRSEMDVVF KDSSLKGPLW EQVSRKMADL GYHRSAKKCK  120
EKFENVYKYH RRTKEGRASK ADGKTYRFFD QLQALENNPA SHSLPPPPLA ATPITMAMPM  180
RSGNASAIPP VMPAGQNHNH PFVVSQNTVV TAAAPAVSHL MTAPTLQPQP AVQPITNNLN  240
QMNRPQGNTT TSFLSTSTSS SSTSSDEDIQ RRHMKKRKMK EFFQSLMKDV IEKQEEMQKK  300
FLEMLEKRER DKLMREETWR EQEMARLNRE HDLLVQERSM AAAKDATIIA FLQKITEQQN  360
TRIPNSTNNT SPSPPPAQIQ LKLSEKPLST PVHSQPSSLS QSQSSRAIAV SLPMTIHTPA  420
PLQPQALSLS VVAAPKSTEP PPKTDNGGEN FSPPSSSRWP KEEIEALIRL RTCLDLKYQD  480
NGPKGPLWEE ISVGMRKIGY NRNAKRCKEK WENINKYFKK VKESNKRRPE DSKTCPYFNQ  540
LEALYKEKAK NEAVQHTAGF GLRPENNNPI APLPPTIMAQ PEQQWPLPQH HQPQLQPQQQ  600
NRDNHRDNES DDSMEHNEEE DNMEEDDEED EDEGGYEIVT KKQASSMAAA TTVSAAAATT  660
AV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1269277QRRHMKKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMap-Retrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010314480.10.0trihelix transcription factor GT-2-like
TrEMBLA0A3Q7JB430.0A0A3Q7JB43_SOLLC; Uncharacterized protein
STRINGSolyc12g056510.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24022459
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.21e-103Trihelix family protein