PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG69209.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1094aa    MW: 119564 Da    PI: 8.2825
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG69209.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix33.31.2e-10311384268
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68 
                 W+ +e  +L++ +r        ++++++r + k+ +We+++k+m+  g  + ++ C +kw+n+ + ykki++ +
  GBG69209.1 311 WSPEEQIQLVRCKReqemhlaGLDHNYGRMRTKEWKWEDIAKRMANAGRPKDANDCMKKWDNIFQNYKKIQRFQ 384
                 9999999999999955555555556666889**************************************98755 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1094 aa     Download sequence    
MAGMQVHAVP PAAEGGRRPP TTEPSRRYDP SMYNHLSSWE TPLPPSDEEP EGDELPTFPL  60
ASGSTQLLSQ TVLAGGSASN GRGEYTTLLQ QGLEDDDDGG VDLRFGLSSG GGREASHTFI  120
IDVDPSPRYI QQSGSRHTEQ STLCGGTSVC VGVKPSSAGR QHGSSAPSAD RLTSTCPERN  180
GVAVGSLRIG GTLHSMPKST TSTQPDLRDD GKCRPAVRPT PSVENITRGV SNMRAHSDGG  240
NDDGGGGDDA DGRFGEDVEA GDDDDDIPIR PMGKTGRRGR GRSRGAVRGR SAGRGGRGGA  300
NDDGGKSATY WSPEEQIQLV RCKREQEMHL AGLDHNYGRM RTKEWKWEDI AKRMANAGRP  360
KDANDCMKKW DNIFQNYKKI QRFQNASGGP DFFRLSNEER KEHNFKFRME RALYNEIHIG  420
MVGNHTIFPP NIADTGSPDG VHLPRRGAGA GESVGSEGAG EGLPKERSTK RDSDNNAASG  480
AGGGKRKNAR QQALESIADV MDRHGELMSS TIESSSKRQC SIFTRQCDIL EQEVAVQKAH  540
YAASDETQRM MCHALMEIAA AIRGRSATRT LSSSSRGGLI SCVAGVFIVV TSSSCGGQIS  600
CVPRARRRLR PAEERSTASR GYPSSSRLPP AEDRSSACHV HIVVVTPQPH GRPFSRETYF  660
SPGRPHFPLV RSHCQARLQR SPRNMSTRGN TRGKKRNVVV DDSQEQGRGR RQAPKAKRVR  720
TGDALPRVRG RGAQGWAGEA EGACEDEFTA EEEQAKATTS EVRESDRQCS SDRTASKRMQ  780
TPPHEAQLLR GRETRTEKAP VVDLGGDDDE PLERRRIRIR TTTTPPRRWS CGLLQTKGQS  840
RAGCPPRRPT WWRGGSRGRR ACSWRPGGRC RGGGCFRQCG RRSDWEEGRS SCGDGERGGA  900
WQEESQAGDL SMDLPLWFAG ANIEDKPEDD DMAAHQDSTV IRVSYAFRAV VQMGALVDGE  960
FISHDRLSRV ADCFRLLLAA CMWIMRMAGD DPHSHYEAFY FADLVVKPTL VAAMHHSFDH  1020
RRSVVRVAKV VTDRLGKATA TFGQYPNYIP QWVPCDIAFR HDASITGPED ANKLDWLGSG  1080
PPDDDNNDDG KDDA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1867874GGRCRGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.13e-06Trihelix family protein