PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Niben101Scf02902g03009.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae; Nicotiana
Family Trihelix
Protein Properties Length: 1258aa    MW: 139104 Da    PI: 9.1827
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Niben101Scf02902g03009.1genomeBTI-
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.77.6e-29132216187
                  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                               rW++qe+ aL+++r+em+  +r+++lk+plWeevs+km++ gf+rs+k+Ckek+en+ k+++++k+g+ ++   + +t+++f+ql
  Niben101Scf02902g03009.1 132 RWPRQETIALLKIRSEMDLVFRDSSLKGPLWEEVSRKMADLGFHRSAKKCKEKFENVYKYHRRTKDGRASK--ADGKTYRFFEQL 214
                               8********************************************************************96..66678******* PP

                  trihelix  86 ea 87 
                               ea
  Niben101Scf02902g03009.1 215 EA 216
                               85 PP

2trihelix92.44.5e-29634718187
                  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                               rW++qe+ aL+++r+em+  +r+++lk+plWeevs+km++ gf+rs+k+Ckek+en+ k++k++k+g+ ++   + +t+++f+ql
  Niben101Scf02902g03009.1 634 RWPRQETIALLKIRSEMDLVFRDSSLKGPLWEEVSRKMADLGFHRSAKKCKEKFENVYKYHKRTKDGRASK--ADGKTYRFFEQL 716
                               8********************************************************************96..66678******* PP

                  trihelix  86 ea 87 
                               ea
  Niben101Scf02902g03009.1 717 EA 718
                               85 PP

3trihelix102.14.3e-3210531138187
                  trihelix    1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfd 83  
                                rW+k ev aLi +r++++ +++++  k+plWee+s+ m++ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+
  Niben101Scf02902g03009.1 1053 RWPKAEVEALIMLRTQLDVKYQENGPKGPLWEEISAGMKKLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFH 1134
                                8*********************************************************************8.99********* PP

                  trihelix   84 qlea 87  
                                ql+a
  Niben101Scf02902g03009.1 1135 QLDA 1138
                                **85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.0052129191IPR001005SANT/Myb domain
PROSITE profilePS500906.888131189IPR017877Myb-like domain
PfamPF138376.7E-19131217No hitNo description
CDDcd122033.91E-26131196No hitNo description
SMARTSM007172.2E-5552693IPR001005SANT/Myb domain
PROSITE profilePS500906.888633691IPR017877Myb-like domain
CDDcd122032.17E-26633698No hitNo description
PfamPF138374.5E-19633719No hitNo description
PROSITE profilePS500907.25910461110IPR017877Myb-like domain
SMARTSM007172.6E-410501112IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.609.2E-410521109IPR009057Homeodomain-like
PfamPF138371.4E-2210521139No hitNo description
CDDcd122034.06E-2910521117No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1258 aa     Download sequence    Send to blast
MTIIEEKSED SVAKRSKSCN CCTEEKEKEK EKEKRRRRRG LIRSRNIVAV LLSLLFRKTK  60
REKKKKMLGV SGIISSTNSG GGGENPESGG GAASGGSSEI AIGGGGISFM TEEDRAIRNM  120
EEGERNSGGG NRWPRQETIA LLKIRSEMDL VFRDSSLKGP LWEEVSRKMA DLGFHRSAKK  180
CKEKFENVYK YHRRTKDGRA SKADGKTYRF FEQLEALENN PSSHHSLLLP PPITSSRPPP  240
PPLEATPINM AMPMPSGNGN TINLQLPASQ QQAAATTTTV TVSSAPPNNI FVSSHQNTIS  300
HQNIPLSSSM APSSQPLPQP ANNPINNLQA TTNFPSRQNI SAMSYSTSSS TSSDEDIQRR  360
HKKKRKWKDF FERLMKDVID KQEDLQRRFL ETLEKRERDR MVREEAWRVQ EVARMNREHD  420
LLVQERSMAA AKDAAVVSFL QKITEQQNIQ IPSNINVVPP SAQVQIQLPE NPPPPPATRS  480
QVVQQQTQPT AVPVSPAPPQ PSPVIPAPIS LPVTKPAPVP VQSLPLTPPV PAKNVELTPK  540
SDNGGEGCTP ASSSRWPKAE VEALIKLRTQ LDVNTNSGGG GENPESGGGA ASGGSSEIAI  600
GGGGVSISGG FMTEEDRAIR NMEEGERNSG GGNRWPRQET IALLKIRSEM DLVFRDSSLK  660
GPLWEEVSRK MADLGFHRSA KKCKEKFENV YKYHKRTKDG RASKADGKTY RFFEQLEALE  720
NNPSSHHSLL LPPPITSSRP PPPPLEATPI NMAMPMPSGN ANTINLQLPA SQQATTTVTV  780
SSAPPNNSSN IFVSSHQNTI SHQNIPLSSS MAPSSQPSPQ PANNPINNLQ ANTNFPSHDQ  840
NISAMSYSTS SSTSSDEDIQ RRHKKKRKWK DFFERLMKDV IDKQEDLQRR FLETLEKRER  900
DRMVREEAWR VQEVARMNRE HDLLVQERST AAAKDAAVIS FLQKITEQQN IQIPSNINVA  960
PPSAQVQIQL PENPPPPSET RSQVVHQQTQ PTAVLVSPAP TQPSPATPAP ISLPATIPAQ  1020
SLPLTPPVPA KSVELTPKSD NGGEGYSPAS SSRWPKAEVE ALIMLRTQLD VKYQENGPKG  1080
PLWEEISAGM KKLGYNRNAK RCKEKWENIN KYFKKVKESN KKRPEDSKTC PYFHQLDALY  1140
KEKAKNETSS SSFSNPSASG FALNPENNPM MPIMASPEQQ WPLPPHHQQH HESTRMDHDH  1200
ESDNMDQDED DEDNEDEDEE NAYEIVANKQ QSSMAAANTT TTTTATATSW IKNGLTGG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12938KEKEKRRRRR
23338KRRRRR
3859866QRRHKKKR
4859867QRRHKKKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009618661.10.0PREDICTED: trihelix transcription factor GTL1-like
TrEMBLA0A1S3XDN40.0A0A1S3XDN4_TOBAC; trihelix transcription factor GTL1-like
STRINGXP_009618661.10.0(Nicotiana tomentosiformis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24022459
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.24e-79Trihelix family protein