PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Niben101Scf02361g07001.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae; Nicotiana
Family Trihelix
Protein Properties Length: 654aa    MW: 72133.9 Da    PI: 7.9977
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Niben101Scf02361g07001.1genomeBTI-
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix92.93.3e-2962145186
                  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                               rW++qe+laL+++r+em+  +r+++lk+plWeevs+k+++ g++rs+k+Ckek+en+ k+++++k+g+ ++   + +t+++fdql
  Niben101Scf02361g07001.1  62 RWPRQETLALLRIRSEMDVVFRDSSLKGPLWEEVSRKLADLGYHRSAKKCKEKFENVYKYHRRTKDGRASK--ADGKTYRFFDQL 144
                               8********************************************************************96..66678******9 PP

                  trihelix  86 e 86 
                                
  Niben101Scf02361g07001.1 145 A 145
                               6 PP

2trihelix104.11e-32460545187
                  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                               rW+k+e+ aLi++r++++ +++++  k+plWee+s+ mr+ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql
  Niben101Scf02361g07001.1 460 RWPKEEIEALIRLRTSLDLKYQDNGPKGPLWEEISAGMRKLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQL 543
                               8*********************************************************************8.99*********** PP

                  trihelix  86 ea 87 
                               ea
  Niben101Scf02361g07001.1 544 EA 545
                               85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.1255119IPR017877Myb-like domain
SMARTSM007170.002359121IPR001005SANT/Myb domain
PfamPF138372.4E-1861146No hitNo description
CDDcd122031.79E-2461126No hitNo description
SMARTSM007172.6E-5457519IPR001005SANT/Myb domain
PfamPF138371.2E-22459546No hitNo description
CDDcd122031.09E-28459524No hitNo description
Gene3DG3DSA:1.10.10.609.3E-5459516IPR009057Homeodomain-like
PROSITE profilePS500907.689459517IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 654 aa     Download sequence    Send to blast
MLGVSSGLMT NTSCGGGAAS ISAPPPQEAP ESGGSSEGGG GGDVAAAGGF SEEGERNSGG  60
NRWPRQETLA LLRIRSEMDV VFRDSSLKGP LWEEVSRKLA DLGYHRSAKK CKEKFENVYK  120
YHRRTKDGRA SKADGKTYRF FDQLAAFENS TSHNSLPPPP LAATPLTMAM PMPVRPNPPI  180
PLSQSMAPPA QNTFVVSQNN VVTAASAVNH PLNVSSLPLS QPPPPPTQPI ITTVNQMNRP  240
QGNTSSLLPN STSSSSTSSD EDIQKRHGKK RKWKNFFERL TKDVIEKQEE LQKKFLETLE  300
KRETERMVRE ETWRVQEMTR MNREHDLLVQ ERSMAAAKDA TIIAFLQKIT EQKNTPIPNI  360
TNASLAQMQF QLSEKSPSGP PHSQPQKQTQ QLTPATPPPP ATASATASAT TPAPAIAVSL  420
PMPIHAQVQT QVPSLPVAKT FEAPKTDNGG ENLSPASSSR WPKEEIEALI RLRTSLDLKY  480
QDNGPKGPLW EEISAGMRKL GYNRNAKRCK EKWENINKYF KKVKESNKKR PEDSKTCPYF  540
HQLEALYKEK AKNEVVPNTG TGFGLKPENN NPMVPIMAEP EQQWPFRSNQ PQQQQQQQGI  600
MSNIIQDHDN ESDSMEEDDY DDDEDEGNAY EIVTNKQPSS MAAATVTAAA TTAV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1263271QKRHGKKRK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016514564.10.0PREDICTED: trihelix transcription factor GT-2-like
TrEMBLA0A1S4DN330.0A0A1S4DN33_TOBAC; trihelix transcription factor GT-2-like
STRINGXP_009594163.10.0(Nicotiana tomentosiformis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24022459
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.12e-62Trihelix family protein