PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen04g027330.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family Trihelix
Protein Properties Length: 652aa    MW: 71831.2 Da    PI: 6.6261
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen04g027330.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix93.12.7e-2958142187
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                       rW++qe+ aL+++r+em+  +r+++lk+plWeevs+km++ gf+rs+k+Ckek+en+ k++k++k+g+ ++   + +++++f+qlea
  Sopen04g027330.1  58 RWPRQETIALLKIRSEMDVIFRDSSLKGPLWEEVSRKMADLGFHRSSKKCKEKFENVYKYHKRTKDGRASK--ADGKNYRFFEQLEA 142
                       8********************************************************************95..67778*******85 PP

2trihelix103.71.3e-32459544187
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                       rW+k ev aLi++r++++ +++++  k+plWee+s+ m++ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Sopen04g027330.1 459 RWPKAEVEALIKLRTNLDVKYQENGPKGPLWEEISSGMKKIGYNRNAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 544
                       8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.02751115IPR017877Myb-like domain
SMARTSM007173.4E-455117IPR001005SANT/Myb domain
CDDcd122031.66E-2557122No hitNo description
PfamPF138378.9E-1957143No hitNo description
PRINTSPR012173.5E-9153165No hitNo description
PRINTSPR012173.5E-9192204No hitNo description
PRINTSPR012173.5E-9215236No hitNo description
PRINTSPR012173.5E-9381406No hitNo description
PROSITE profilePS500907.457452516IPR017877Myb-like domain
SMARTSM007174.6E-4456518IPR001005SANT/Myb domain
CDDcd122033.39E-28458523No hitNo description
Gene3DG3DSA:1.10.10.601.7E-4458515IPR009057Homeodomain-like
PfamPF138372.3E-22458545No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 652 aa     Download sequence    Send to blast
MMLGVSGLVS SEGGGDNPES GGGAGSGGSS EIGLGGGSGG GGGGFMTEDG ERNSGGNRWP  60
RQETIALLKI RSEMDVIFRD SSLKGPLWEE VSRKMADLGF HRSSKKCKEK FENVYKYHKR  120
TKDGRASKAD GKNYRFFEQL EALENITSHH SLMPPSSNTR PPPPPLEATP INMAMPMASS  180
NVQVTASQGT IPHHVTISSA PPPPNSLFAP SHQNAPSSSP VPLPPPPSQQ PSPQPAVNPI  240
NNIPQQVNAS AMSYSTSSST SSDEDIQRRH KKKRKWKDYF EKFTKDVINK QEESHRRFLE  300
KLEKREHDRM VREEAWKVQE MARMNREHDL LVQERAMAAA KDAAVISFLQ KITEQQNIQI  360
PNSINVGPPS AQVQIQLPEN PLSAPVPTQI QPTTVTAAPP QPAPAPVPVS LPVTIPAPVP  420
ALIPSLSLPL TPPVPSKNME LVPKSDNGGD SYSPASSSRW PKAEVEALIK LRTNLDVKYQ  480
ENGPKGPLWE EISSGMKKIG YNRNAKRCKE KWENINKYFK KVKESNKKRP EDSKTCPYFH  540
QLDALYKEKA KNPETASLTS SFNPSFALNP DNNQMAPIMA RPEQQWPLPQ HHESTTRIDH  600
ENESDNMDED DHDDDDDEDD EDENNAYEIV ANKQQSSMAA ANTTTSTATT TV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1266273QRRHKKKR
2266274QRRHKKKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754430.0HG975443.1 Solanum pennellii chromosome ch04, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015073652.10.0trihelix transcription factor GT-2-like
TrEMBLM1C8A30.0M1C8A3_SOLTU; Uncharacterized protein
STRINGSolyc04g071360.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24022459
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.22e-77Trihelix family protein