PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG72975.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 862aa    MW: 93527.1 Da    PI: 7.1296
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG72975.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix36.71.1e-11363434266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi a+r+ + +l+       r k ++ +W +v+++++  g+ r+ ++C +kw+nl +++kk+ +
  GBG72975.1 363 WSVEHIIALIWAKRDQDAHLQgmghayaRMKPREWKWNDVARRLKNVGVDRKLEKCGKKWDNLMQQFKKVHH 434
                 **************7766666433333367899************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 862 aa     Download sequence    
MADIFRKLPL LLLAVFLLVV VVVLHVCFIG NAPRRRRRRT SPMKAVRTDS GQGSPRGGGE  60
QVVSGRVARS SSPACRLSLG VGNSSLPPHL QPLPDSSDEE EREGRARTVP LGSGSTQEWS  120
WTELFGGSGG VHGQSCTELL APGLDGEEGH GGSNLSSGLS TGRCESQSRT VIVNPHVGDG  180
GGKLTDVDRS SKSAGAQKWQ GRATSISRST QGRRSFMQSP ALVSAASNLG RRRGIVEEGG  240
GYLDDVVDDH DGRLVWAEER RKIREGREEA IRRGVERLRM DRQAEEVEEP HAGLPSEDDD  300
DDGAGEAGDG NGGYASPSQN SDGGGKGGKT KATSGNGRGP KKAQAKANDG EGDGDPEEKR  360
NSWSVEHIIA LIWAKRDQDA HLQGMGHAYA RMKPREWKWN DVARRLKNVG VDRKLEKCGK  420
KWDNLMQQFK KVHHFQSPSG GIDFFQLNGK ERARHDFNFN MDRAVYDEIE GSSGFNETIY  480
PKNVADTGAR GGVRLPSTTT ADPEAIGDAD ACAGGEDEDE GSTRGSSQTS GSPHGFGKRK  540
STRQQTFEAM TECMDKHGAL MAATMESASK RQCSIQLRQC EALEAEVQVQ KTHYAAFDET  600
PSSKRSEQAV GVASSSQAVA DPSTMRSPAS QPPGGAVQGA SAMADVAKAG DGGAEGEDDE  660
PLVKKLRGQR PEAKVMEVAA RLWTDDIRSI PQKKIEDASE LRAAKERALK VESIAKRAIH  720
GWIFKSDSRH KGYHLAYQYA LNHVATDMAR AMWALEDWRS LVSPMAIRNM LELGMKLPLW  780
FVGANVVDRH QHNECAAYQE AISQSFVRDF TNVVEVAQAM DSGRVSYERL KSLAEAMRYL  840
LAAAAAWIMR MAGDDARSHF DA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13439RRRRRR
23440RRRRRRT
33541RRRRRRT
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33550.13e-08Trihelix family protein