PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG71252.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 898aa    MW: 98134 Da    PI: 7.1658
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG71252.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix29.42e-09203279270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergf.erspkqCkekwenlnkrykkikegekk 70 
                 Wt ++  aLi+a+r+ +++l        r k k  +W++v k++ + g+ +r+ + C +kw+nl +++k + + +++
  GBG71252.1 203 WTVEHMIALIRAKRDQDSHLVglahttgRMKTKTWKWDDVEKRLVHMGVtSRKVVDCGKKWDNLYQQFKTVHKFMGE 279
                 **************777766644443336789999*********99999799*****************99876665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 898 aa     Download sequence    
MGSQLHRQAS TTTYSDLLEG RAPAGYDAGL VDLSFGLRSG SAEEATRTVI VNPALGTTHT  60
PPQVTTRTGR SVPCRTTTAG GTVDGRRPNE EWSATXVVGR KFWDDHRRQS REASTAGITR  120
GVAKISVGAD DILRDEDGGV AEDCEADDGA GNDDEDMEIR PLGRKRGGLT AAKKLPEMRT  180
GRRGKKGVED ASPDEGSKSR DFWTVEHMIA LIRAKRDQDS HLVGLAHTTG RMKTKTWKWD  240
DVEKRLVHMG VTSRKVVDCG KKWDNLYQQF KTVHKFMGES GKPNFFTLTP GERKERGLDF  300
RMDECVYSKM AAMTTSDHTI HPTNLADTGA TGGVQMSAPR GGRNESGGSE GSGDGQDDDQ  360
GSTKDSISGG GVGGGSKRKN VRQQTFDMIA DVMKEHGSLI ASTVDCASKG QCSILTRQCD  420
ILEREVEVQK EHYFKADQAN LMMCEALMEI AKAIRKRSKD GGVGDGGETN KGRGHIPKSK  480
RQRIDDASSS QTEDFLADEV AMVDAQGTTG VVRLGFGRDG VSREQLQAAR RSVVGGAAGT  540
VPRSPNTARV VVTAARVAGQ ISAAAGQDGK REGNDGDDRP LVSRGKGAPK EDELQEKAKL  600
WVDCDAFWGE GPVKPLREAV GECTDYFVTV ANGDARAEPP SMLIMPPNDV PRFKIDDPAQ  660
REPALRRARS VERVVLRTIH GWIFKSQSRW TGFSRAESYI TVDFAMDLAR AVWQALEWSR  720
VVSPALVYHT VAMKMDVPLW FAGLKIEDRP EDDNMAARQE ATVLLLAECW TDALWCGQWA  780
DDGRVKQDRL SRLADCLRAL LCAVMWIMRM GGDDNRSHYE AWSYASMVAK PTMIGAGSYI  840
FNWRQHVVDS ANLVLHRLGK AHLTLGDYPQ CIPEWCDCGL AFGHNAALKN AAEAAKHG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1478483KSKRQR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.13e-06Trihelix family protein