PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG84385.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1311aa    MW: 137610 Da    PI: 6.3176
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG84385.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix27.86.5e-09503599279
    trihelix   2 WtkqevlaLiearremeerlrrgk....................lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstc 79 
                 W + e+++Li++   ++++ r g                        + W+++ ++m+++gf+rs  qC+ +w+nln+  +++ + ++++ ++++s +
  GBG84385.1 503 WGDYETMVLIRLLLGDKQTGRVGFlvnprsnwggggvfggekerTSNATWRAIEQAMKSEGFKRSWRQCRTRWRNLNRWSQRVIKYDSRQ-NGRRSYW 599
                 999999999999998888888653333444566666666666668899*****************************9888877777764.4554455 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1311 aa     Download sequence    
MGVPVMLFDG NVGDDAWERL RKELRMRAMR LDILLRKRQQ PPTRQRSTRE QRLTSQRPTS  60
QEATRQQQRW GPRLWTDAAC VGVRTTRGPA AAANATTPYY VANESMIKGV DGERTRGSSS  120
SDVAVGGDLC TPGGSDVGGG GLFAAAGGSS KIVFVPARGS DVGRGTGGLI DLRTRGSSGG  180
GLCTPGGGDV GGGLFAAGAR KIVFVPARGP DDGRGTCGIV DLRTRGSDVE NVCTRGRNVD  240
GGTRGSDGGE VRTRGSDLAV GPRGRPRGSD LAGRKRGSSV FFDGRTRGSG SLAAEAQRPI  300
SEEEEEEEEE EEEEEEEEEG GTRDALSNKD KEKEKEKEKD KGGSEGRVRR GTCVLPRVHG  360
QGKKGEVGMA VRGEGGGGEG GGRVLVVSGE EERDGRRVVR GKVGGDACRD DLASSHHNQG  420
MNTASRVCHV AAAGGVDEMA RDGVVMSREA VDCSSPSSSR RRMIGRRGEE RQVEGDSAAA  480
VAAAGGAEGG PRGRLRRRSR NNWGDYETMV LIRLLLGDKQ TGRVGFLVNP RSNWGGGGVF  540
GGEKERTSNA TWRAIEQAMK SEGFKRSWRQ CRTRWRNLNR WSQRVIKYDS RQNGRRSYWK  600
MNPSQRCQSN LPFNMRRNWF EAIGAARQVY RAAAAAEDDS RRFGVVGGHH HHHPHHQLAG  660
PGVGHVSGSA PDNRDPNNLS GSGHVSGSPA DKQDEHLSGS RHVSGYHNLA GPGVCHVSGS  720
APDSRDPNNL SGSGHVSGSP ADKQDDHLSG SCHVAGSRVD KQDCNNLSGN QPDTQPHNEP  780
EIGHVSGSGA NSQHHQNVSG IGHVSGGSRQ ERHDHNNISG SRSDRQNQNV SEIGHVSGSG  840
ADRQDEKKNV VHAGGVGVEE EERGGHAHRN GGEESSSDEG DLDGVRRGVS SDSGHHNAEG  900
SGDGRVSGSR GCAHQKEEGS SGDGHVSGNL LVCAHHSGAR SSAGNGHSSG DISDDDNITG  960
DDHVSDDDHV AGDDHVAGDD HVAVRQLHSQ HHNTLAEVEE GVSRGCAHEN REGFSGDGHV  1020
SGSLARTHQS PGKSDDVNLS GGDRALKVSG DGAPRISPSR EERHVSSGDG GACHVSSEDE  1080
GACHVSSGDG GACHVLLLSG DEEARQCDGK GSSSGRETRG RSSSPRPDPD TGHVSGSPAD  1140
GFSDDDCDYS SSMSFTDSDR SGPGPGPGHD YDDLSSPSPK RRRLTASEED GNATANAIPM  1200
EMEMEVEVEK GSSVGVLMTE MTSEGGGGGG EGGGGGGGGG GGGGEGGGGI LDECYLEQFP  1260
AEKAWEEVRD AMREDIALQR ETLGMLASAI KTGILGFLAK ELESSKGLHL T
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111801184KRRRL