PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Peaxi162Scf01294g00224.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Petunioideae; Petunia
Family Trihelix
Protein Properties Length: 1220aa    MW: 135931 Da    PI: 6.9654
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Peaxi162Scf01294g00224.1genomeSGNView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix90.91.3e-2868151186
                  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                               rW++qe+laL+++r+em+  +r+++lk+plWeevs+k+++ g++rs+k+Ckek+en+ k+++++k+g+ +++  + +t+++f+ l
  Peaxi162Scf01294g00224.1  68 RWPRQETLALLRIRSEMDVVFRDSSLKGPLWEEVSRKLADLGYHRSAKKCKEKFENVYKYHRRTKDGRASKS--DGKTYRFFEAL 150
                               8********************************************************************974..5567*****99 PP

                  trihelix  86 e 86 
                               e
  Peaxi162Scf01294g00224.1 151 E 151
                               8 PP

2trihelix1019.7e-32466551187
                  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                               rW+k+e+ aLi++r+ ++ +++++  k+plWee+s+ mr  g++r++k+Ckekwen+nk+ykk+ke++kkr +e+s+tcpyf+ql
  Peaxi162Scf01294g00224.1 466 RWPKEEIEALIRLRTCLDLKYQENVPKGPLWEEISAGMRNLGYNRNAKRCKEKWENINKYYKKVKESNKKR-PEDSKTCPYFHQL 549
                               8*********************************************************************8.99*********** PP

                  trihelix  86 ea 87 
                               ea
  Peaxi162Scf01294g00224.1 550 EA 551
                               85 PP

3trihelix38.33.5e-126657123786
                  trihelix  37 kmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                               k+++ g++rs+k+Ckek+en+ k+++++k+g+ +++  + +t+++f+ le
  Peaxi162Scf01294g00224.1 665 KLADLGYHRSAKKCKEKFENVYKYHRRTKDGRASKS--DGKTYRFFEALE 712
                               67899****************************974..5567*****998 PP

4trihelix1019.7e-3210271112187
                  trihelix    1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfd 83  
                                rW+k+e+ aLi++r+ ++ +++++  k+plWee+s+ mr  g++r++k+Ckekwen+nk+ykk+ke++kkr +e+s+tcpyf+
  Peaxi162Scf01294g00224.1 1027 RWPKEEIEALIRLRTCLDLKYQENVPKGPLWEEISAGMRNLGYNRNAKRCKEKWENINKYYKKVKESNKKR-PEDSKTCPYFH 1108
                                8*********************************************************************8.99********* PP

                  trihelix   84 qlea 87  
                                qlea
  Peaxi162Scf01294g00224.1 1109 QLEA 1112
                                **85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.1261125IPR017877Myb-like domain
SMARTSM007170.002365127IPR001005SANT/Myb domain
CDDcd122036.32E-2567132No hitNo description
PfamPF138372.7E-1867152No hitNo description
PROSITE profilePS500907.457459523IPR017877Myb-like domain
SMARTSM007170.0015463525IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.609.1E-4464522IPR009057Homeodomain-like
CDDcd122031.38E-28465530No hitNo description
PfamPF138376.8E-22465552No hitNo description
PfamPF138372.3E-7664713No hitNo description
PROSITE profilePS500907.45710201084IPR017877Myb-like domain
SMARTSM007170.001510241086IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.609.1E-410251083IPR009057Homeodomain-like
CDDcd122031.38E-2810261091No hitNo description
PfamPF138376.8E-2210261113No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1220 aa     Download sequence    Send to blast
MLGVSSGLIN TTSADGATST QAAPQQPPLA ATVAESGGSS EGGGGGGGDM TVGGGFNEEG  60
ERNSGGNRWP RQETLALLRI RSEMDVVFRD SSLKGPLWEE VSRKLADLGY HRSAKKCKEK  120
FENVYKYHRR TKDGRASKSD GKTYRFFEAL EAFETNPSHS LPPPPLAATP ITMAMPMRGN  180
ATMNNPPMPM SQTTVSSTQI PFTISQNTVP TAATPIVNHP LNNNVSPLPV SSQPSQPPPP  240
AQPNITLNQI NRAQGNTSFL SNSTSSSSTS SDEDIQRRHG KKRKWKDFFE GLMKDVIEKQ  300
EELQKKFLET LEKRERERLV REETWRVQEM ARMNREHDLL VQERSMAAAK DATIIAFLQK  360
ITGQQNIQIP NITTNTSPTQ IQLQEKPLVA PPQSQPQKQT QTPPPPPATA VSLPMPVPAP  420
AIAVSLPMPV PAPLPTQALV PVPARTMEVA KLDNGGENFT QASSSRWPKE EIEALIRLRT  480
CLDLKYQENV PKGPLWEEIS AGMRNLGYNR NAKRCKEKWE NINKYYKKVK ESNKKRPEDS  540
KTCPYFHQLE ALYKEKAKTE VVPITTTAFG LKPENNPSMV PVMAQPEQQW QPLGLDQQQV  600
HHQALNNMPD HDNESDSMDE DEEEDDDDDD EGNVYEIVTN KQPSSMAAST TTAAATATKF  660
LLVRKLADLG YHRSAKKCKE KFENVYKYHR RTKDGRASKS DGKTYRFFEA LEAFETNPSH  720
SLPPPPLAAT PITMAMPMRG NATMNNPPMP MSQTTVSSTQ IPFTISQNTV PTAATPIVNH  780
PLNNNVSPLP VSSQPSQPPP PAQPNITLNQ INRAQGNTSF LSNSTSSSST SSDEDIQRRH  840
GKKRKWKDFF EGLMKDVIEK QEELQKKFLE TLEKRERERL VREETWRVQE MARMNREHDL  900
LVQERSMAAA KDATIIAFLQ KITGQQNIQI PNITTNTSPT QIQLQEKPLV APPQSQPQKQ  960
TQTPPPPPAT AVSLPMPVPA PAIAVSLPMP VPAPLPTQAL VPVPARTMEV AKLDNGGENF  1020
TQASSSRWPK EEIEALIRLR TCLDLKYQEN VPKGPLWEEI SAGMRNLGYN RNAKRCKEKW  1080
ENINKYYKKV KESNKKRPED SKTCPYFHQL EALYKEKAKT EVVPITTTAF GLKPENNPSM  1140
VPVMAQPEQQ WQPLGLDQQQ VHHQALNNMP DHDNESDSMD EDEEEDDDDD DEGNVYEIVT  1200
NKQPSSMAAS TTTAAATATV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1836844QRRHGKKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754437e-44HG975443.1 Solanum pennellii chromosome ch04, complete genome.
GenBankHG9755167e-44HG975516.1 Solanum lycopersicum chromosome ch04, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009594163.10.0PREDICTED: trihelix transcription factor GT-2-like
TrEMBLA0A1S4DN330.0A0A1S4DN33_TOBAC; trihelix transcription factor GT-2-like
STRINGXP_009594163.10.0(Nicotiana tomentosiformis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24022459
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.25e-55Trihelix family protein