PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG71213.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1097aa    MW: 119224 Da    PI: 7.1254
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG71213.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix39.61.3e-12352427270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                 W+ + + aLi+a+r+ + +l+       r k ++ +W  v+ ++++ g+ r++ +C +kw+nl +++kk+ + ++ 
  GBG71213.1 352 WSVDDIIALIRAKRDQDAHLQgmghvyaRMKPREWKWLNVATRLKKVGVDREADRCGKKWDNLMQQFKKVHHFQGL 427
                 99************9998888544444456899999*********************************9887665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1097 aa     Download sequence    
MABVFRRLPL LLLVVLLLLL LVVFHVCILG GLPRRGRRRR CVKNNDRMEG QKAISGSAGE  60
CGGRWSGNSP SNSPRPSEQS GHAHLPPHLQ PLPDTDDEEG DDRRSLTVPL GSGSTQEWVA  120
TELFGSRDGG KGQSYTELLQ QGISDTDGDG GVNLSFGLGS GRSAVVSRTV VVNPHPHDDG  180
SDVTAVQRSP RSPAPLREAS GNNNDPPRQQ FRSPSVCRGR RVGECRETAP AVADVGDARD  240
GREVWAEQRQ LMRSVREESI TRGVQRLRVG EDGHDGEDAG ADAHDPDWND NGAEGWEDDA  300
ANISPSKQAA AMGGRGGKTK SCSGNGRRGK RTAGKGSDAE GDVDGEGGRH FWSVDDIIAL  360
IRAKRDQDAH LQGMGHVYAR MKPREWKWLN VATRLKKVGV DREADRCGKK WDNLMQQFKK  420
VHHFQGLSGK QDFFQLSGKD RMSKGLKHDD DDDGSTKGSS QTTGGTGGFG KRKSTRQQTF  480
EAMTECMEKH GALMASTMES NSKRHCSIAI RQCEALEAEI EVQKKHYAAS DEILAFVVQI  540
VIVTAGASCA MSSRASTRGK AAANLSQQPA TREKKGRHMA SKKRKILEGA PAHGGYVLDE  600
EWVPKDVATQ EGTDFEDSDD MPLQRKSSRR GSGGIRIDDA GERRRAGGRP VPEDVVDVDA  660
ATTAKEGGGN RAPLPRMTPP TDTQEGVVCL RTPLTPRSRT AADVALSGGP SQAAGGVRAG  720
GAAGYAAKGG ESNAGEGATA GVKASDVAGE GDDDDPSVNR LRQQNTREME AAAKLWVDDL  780
RFWNEREGFA IVKLIAEARG YLVVVARGEQ PPPIRRSIVL PHNSIPQHKI TDESELNVAK  840
ERALKVQGIA LRVIHGWVFK SQNMQRGYHA AYQYALNHAA TDIARAMWMG EDWRYCVSPM  900
VIHHTLDMHM KLPLWFVGAD VEDRHEDDGL AAYQEASIQR LVGDFTSAVI IAEATDGDRV  960
SHERLKTMAD AMRMMLAATM WLMRMAGDDH RAHYNAWVFV QLTAKPTLVA SMHRCFDARR  1020
HIVQAVTVIT DKLASPPITL IDPPMYVPDW ASIGVKFSHD ATLSSPMEAK KVDWLGTGPP  1080
EDEDDGKGDE QGSGGGR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-07Trihelix family protein