PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG61087.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 855aa    MW: 91829.2 Da    PI: 5.6224
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG61087.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix29.52e-09207283270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergf.erspkqCkekwenlnkrykkikegekk 70 
                 Wt ++  aLi a+r+ +++l        r k k  +W++v  ++ + g+ +r+++ C +kw+nl +++k + + +++
  GBG61087.1 207 WTVEHMIALIPAKRDQDSHLAglahttgRMKMKTWKWDDVETRLVQMGVtSRKAVDCGKKWDNLYQQFKTVHKFMGE 283
                 **************9888888555543356799999********99999899*****************99876665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 855 aa     Download sequence    
MGSQLHRQAS TPTYTDLLEG RTPAGYDAGL VDLSFGLRSG SAEEVTCTVI VNPASGSTRT  60
PAPVTTRTSG SVPCRTTTAG GTADGRRPNE EWSATEVVGR KFWDDHRRQS HEASTTGITR  120
GVAKITVGAD DILGDEDGAV AEECEADDGA GDDDEDDDEE MEIRPVGRKR GGSRAGKKLS  180
ETPTGRRGKK GVEDGSAGEG SKSQDFWTVE HMIALIPAKR DQDSHLAGLA HTTGRMKMKT  240
WKWDDVETRL VQMGVTSRKA VDCGKKWDNL YQQFKTVHKF MGESGKPNFF TLTPGERKER  300
GFNFLMDERV YSEMAAMTRS DHTIHPTNLA NTGATGGVQL SGPRGGRNES GGSEGGGDGQ  360
DDDQGSTRDS ISGGGVGGGS KRKNVRQQTF DTIADVMKEH GSLMATTVDS ASKRQCSILT  420
RQCDILEREV EVQKEHYVKA DQANLMMCEA LMEIAKAIRE RSKDGGGGDG GETNKGRGHI  480
PKSKRQRIDD ASSSQTDDFL ADEVAMVDAQ GTAGVARLGF GRDGVAREQL QALKCSVVGG  540
AARTVPRTPN TAGVVVTGAW VAGQLSPAVG QGQPRQSLPL QHVLASGGGH MATAQKGDAT  600
ASASRAAAAD HTTKGGVVEG AERGGVEDIG RDDGRRDGKR EGNDYDDRPL LPRGKGAPKE  660
DDLEEKAKLW VDCDAFWGQG PGKPLREAVG ECTDYFVAIA NGDAGVEPSS MLIMPPNDVP  720
RFKIDDPAQR DPALXRARSV ERVVLXTIHG WIFKSQSRST GFSRAESYIT VDFATDLARA  780
VWQALEWXRV VSPALVYHTL AMKMDVPLWF AGVKIEDRPE DDDIAARQEA TILLLAECRT  840
DAIWCGQRAD GGRVK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1482487KSKRQR