PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG66332.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1154aa    MW: 126382 Da    PI: 4.8235
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG66332.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix33.31.3e-10789857263
    trihelix   2 WtkqevlaLiearremeerlrrg.......klkkplWeevskkmrergferspkqCkekwenlnkrykk 63 
                 W+ +   aLi+a+r+ + +l+         k ++ +W +v +++++ g+ r+++qC +kw+nl +++kk
  GBG66332.1 789 WSVDDMIALIRAKRDQDAHLQGMgtafacmKPREWKWLDVEQRLKKVGVDREAEQCGKKWDNLMQQFKK 857
                 99999*********9999998532223333688889999*****************************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1154 aa     Download sequence    
MEMSGTSNVE VITEQQGEAM TANGLVPSTT PSLEFPPPPP PPFSHPCNLS PIIAYTHQAG  60
TIDQCTSDSE AGATYPSCHQ QQQQAEFPAL APSGTLSSAA PVSAASSPSW TPSSGPFATC  120
MVGVVTSSYA SHNPLSPSPA FSHIRPLLRG CYTSATRPMM CPPHQRVGGT QFLDGSWIFK  180
SSGNRVTTTA LTVPVHGESE VGPHPHSLLV NPQSGAGSRK ATRMIRGRTD AMQIQKAALK  240
TAKMLIINEE NSHQRKRLAE GKDDAGVDLK SRRPGDAYEQ QRTEEDEDDE EEDDEDEEGE  300
EEVGDEGEEE EDEQQGDECS MGEDDMDADE DGGDDKEEEE EEIDVEETET GEEEEEQDEE  360
EEEEEEEEEQ EEDGEEKREG EENAKYKMTV DTTKIGVNAR TPTLANRSGM TPPHPEGHNP  420
KRRKKKRRRT ADNNMMIMMT GPTPITGRQQ VPSSPSWPRC RRPRAEARRQ KATTTRTRDN  480
FGWTAGIVRS LLRLRKEVRE DLGIDYGHHP SNVSVWKDIA VRMYQKHPET ARLDAMQFKN  540
KFNNLRSTCA SYTEVLRQGL SGDEGDGSIN LSFGLCSGRS SAASRTVIVD NHPDDEGGQV  600
TAVVQSLKSP PPVWEASGNN RDPPRQQYRT PSVSRGASTR PLWMQSPSPL SASWTAARRR  660
GECGETDCVI NDVGDAREGR EVWAEQRRTL HPQREESITW GMQRLRVGDH ENDGDAPAVG  720
ADDQEWNDDD GEGGEEDAGH VSPFKQSSMG GRGGKTRACA LNGRRGKKAV GKGSDAEADA  780
DAEGGRQFWS VDDMIALIRA KRDQDAHLQG MGTAFACMKP REWKWLDVEQ RLKKVGVDRE  840
AEQCGKKWDN LMQQFKKRLW KGFNFNMDRA VYDEILGSTA KSHTINLKNV ADTGSPGGVR  900
LPSATSADHE SVGDGDAVAG HDDDDSDGGS TRGSSQTMAS SAGFGKRKST RQQTFEALTE  960
CMEKHGALVA SMMESNNKRQ CSIQIRQCEA GDVDPILTTC FAIDVEILTS RVVKLEVGLS  1020
HVMSSRGSGR GKSAGKLAVE AAIREKKGRH VANKHRMLVE GGSQLARNVE DEEWVAEEAA  1080
SQVESDFEEE EEVSLKRKSS RRGSGALRIE DVGERRGGGG RALMEDVIDV NAAATSRRGG  1140
GTAVTQQHRA QCHV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1421427KRRKKKR
2421429KRRKKKRRR
3422428KRRKKKR
4422427RRKKKR
5423429KRRKKKR
6424429KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-06Trihelix family protein