PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Niben101Scf04731g03002.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae; Nicotiana
Family Trihelix
Protein Properties Length: 500aa    MW: 56859 Da    PI: 9.0063
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Niben101Scf04731g03002.1genomeBTI-
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix92.93.3e-2973156186
                  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                               rW+++e+laL+++r++m+ ++r+++lk+plW+e+s+km e g++r++k+C+ek+en+ k++k++k+g+++r  ++ +++++f+ql
  Niben101Scf04731g03002.1  73 RWPHEETLALLKIRSQMDIAFRDSNLKGPLWDEISRKMGELGYNRNAKKCREKFENIYKYHKRTKDGRSGR--QTGKNYRFFEQL 155
                               8********************************************************************97..56668******9 PP

                  trihelix  86 e 86 
                               e
  Niben101Scf04731g03002.1 156 E 156
                               7 PP

2trihelix93.42.2e-29365450186
                  trihelix   1 rWtkqevlaLiearremeerlrr.gklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdq 84 
                               rW+k ev aLi++r++++ ++ + g+ k+plWe++s  m++ g+ r++k+Ckekwen+nk+y+++keg+k+r +e+s+tcpyf+ 
  Niben101Scf04731g03002.1 365 RWPKAEVEALIKLRTNVDLQYPDnGSPKGPLWEDISTGMKKLGYDRNAKRCKEKWENINKYYRRVKEGQKRR-PEDSKTCPYFHL 448
                               8*****************99996489********************************************97.9**********9 PP

                  trihelix  85 le 86 
                               l+
  Niben101Scf04731g03002.1 449 LD 450
                               87 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007171.1E-470132IPR001005SANT/Myb domain
PfamPF138373.5E-1972156No hitNo description
CDDcd122033.95E-2372137No hitNo description
PROSITE profilePS500907.29472130IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.604.7E-473131IPR009057Homeodomain-like
SMARTSM007170.13362425IPR001005SANT/Myb domain
PfamPF138374.7E-22364452No hitNo description
CDDcd122031.28E-20365430No hitNo description
PROSITE profilePS500906.853365423IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 500 aa     Download sequence    Send to blast
MLETSVLLED SANAAAAGGA AVTQAVELRN DGGGGCGSVG GGSEEEERSR GELEGEKNNI  60
SGGNRWPISG GNRWPHEETL ALLKIRSQMD IAFRDSNLKG PLWDEISRKM GELGYNRNAK  120
KCREKFENIY KYHKRTKDGR SGRQTGKNYR FFEQLELLDN QINRMDTTTS ISMPVPMPMP  180
MTMIKPATSG CQDFSYRNQG FNPEFMSTST STTSSSGKEY GSVKKKRKLA GYFERLMKQV  240
LDKQEDLQNK FLEAIEKSER DRIAREEAWK VQEIARLKKE KEALANERAI SAAKDAAVIA  300
FLQKISEQTV QVQSPMELSH EKKTENSSVK TVESQENVLQ QENMLDKQDI IDSAGENSFH  360
MSSCRWPKAE VEALIKLRTN VDLQYPDNGS PKGPLWEDIS TGMKKLGYDR NAKRCKEKWE  420
NINKYYRRVK EGQKRRPEDS KTCPYFHLLD SIYQIKSKKQ LLSSENPGSS MKAGQLLMQI  480
MNQQQQQQLE TERQNVEQSQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1223228KKKRKL
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009626647.10.0PREDICTED: trihelix transcription factor GT-2-like
TrEMBLA0A1S3Z0J00.0A0A1S3Z0J0_TOBAC; trihelix transcription factor GT-2-like
TrEMBLA0A1U7VX140.0A0A1U7VX14_NICSY; trihelix transcription factor GT-2-like
STRINGXP_009626647.10.0(Nicotiana tomentosiformis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA97942328
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.14e-69Trihelix family protein