PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG73755.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1111aa    MW: 119503 Da    PI: 9.7915
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG73755.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix37.75.4e-12347421269
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+ +   aLi+a+r+ + +l+       r k ++ +W +v +++++ g+er++++C +kw+nl +++kk+ + ++
  GBG73755.1 347 WSIDDMVALIRAKRDQGAHLQgmgtayaRMKPRELKWLDVEQRLKKVGVEREAERCGKKWDNLMQQFKKVHRFQG 421
                 9999999*******66666653333322679**************************************987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1111 aa     Download sequence    
MADVFRRLPL LVLVVLLLVL VVLFHVCVVG IVPQRRRRRR CAKKDVSMDG RQAVCRSAGG  60
EGGGQRAGQS PSRGSRAVER SEYPHLPPHL QPLPDTSNEE EDNRRSRAVP LGSGSTQEWT  120
STELCGSREP SYGHGRLSSS TRTVLVDNRP DDDGAQVTAV ARSSKSPASS VRVASGNNRD  180
PRTQQYRASS ASRGASARPS WMLSPSPLSG NSSAARNRGE CGEHDCGIEE RGDRRDGREV  240
WEEQRRMLHP RRQESITRGV QRLRVVDDEN DGDTPDVGGN DQDLNDDDCG GGEDDAGQVS  300
PSKQSSMGGK GGRVKVSVRN GRRGKKAMGK GSVVEADGDA EGDRHFWSID DMVALIRAKR  360
DQGAHLQGMG TAYARMKPRE LKWLDVEQRL KKVGVEREAE RCGKKWDNLM QQFKKVHRFQ  420
GLSGKQDFFQ FTGKERLSKG FSFNMDHTGA PGGVRLPSAN SGDPESVGDR GAAVGLDDDG  480
DGSTRGSSQT TGGPTGFGKR KSTRQQTFDA LSECMEKHGA LVVSTMESNS KRKCSIQIRQ  540
CEALEAEVEV QKKHYAASDK VSKLMCHALL EIAKAIREHV PSPSTPSSDC ANVFPCAFVA  600
FFAVIICVFD TDVPHSNLVA FVTICAVTLS KDFDVVLYSL IIATLLGGET GRHSSHHPCV  660
SWRVCRAPWH PEVLDAGGLP GNRRSTRLSG IKGAATWRTN RESSSKGPLR YVESDFAESE  720
EIPMKRKSSR RQTGALRIDD VGERHSAGGR GANEQDVNVA ARLATGRAGG SAAMQQNRAP  780
LPRVSEPPAA QEVFVRGRTP STPRQTTATK VGVAAQGIGQ GIASRSPAHD ERGASTAVAA  840
RATEMAKAGA ATAGGASQAA EGGRSVGAAT VGASHAVEGA RAGGGNAGEG SRAGAVAGVG  900
EDDEALANRV RQRSAREGID VASKLWVDDI HFWNGTEGNA IVKLIAYIED RHEDDEQAAY  960
QEASVQRLVG AFTSAVSTAE GVDGGRVSHE RLKSVAEAMQ VMLAATMWLM RMGGGDRRAH  1020
YNTWVFVQLT AKPTLIASMH RSFDARRHIV QAATAITDKL AKPPITLLAP PLYIPDWVSI  1080
GVKFLARRHF VFPNGGSKAP LVGHRATGRR R
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13440QRRRRRR
23540RRRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-08Trihelix family protein