PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG85096.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1001aa    MW: 107641 Da    PI: 6.7205
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG85096.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix43.67.6e-14357428266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r+ + +++       r k ++ +W++v+++++  g+ r++++C +kw+nl +++kk+ +
  GBG85096.1 357 WSVEHIIALIRAKRDQDAHMQgmghaytRMKPREWKWQDVAQRLKNVGVDRNAEKCGKKWDNLMQQFKKVHH 428
                 9*************7777777444433366899************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1001 aa     Download sequence    
MADVFRRLPL LLLVVLLLLL VVLFRVCILG NVPPRRRKTR SSMKDARMEA RQAIPRSGGE  60
QVGGRRCGGS LSTVGRASQR VGYDALPPHL QPLPGSSDEE EEVERRPQTV SLGSASTQEW  120
TATELCGTGG GVYKESFTEL LRPSLGEDEG DGRVNLSFGL STRRSTTPSR TVLVRPHPGE  180
EAGQLTVVDR SARTRALASE TAGANRNSCH LPCLWRRKWR EAVASESTAG PTSWMLVMVA  240
TGGRCGGTCD GITGCGERNT SHGGVERLHV GDRENEKETD DPPAEADDDN DDDDDNDVDC  300
GEGGDGHASP SFQSDMAGKG GKSKPSGRNA RPRAKKGQGK GSGGEGDGDA EEKRNFWSVE  360
HIIALIRAKR DQDAHMQGMG HAYTRMKPRE WKWQDVAQRL KNVGVDRNAE KCGKKWDNLM  420
QQFKKVHHFQ SPSGGADFFQ LSSKERASKG FNFMMDRTVY DEIEGSTGMN HTIHPKNVAD  480
TGASGGVRPP SASYVDPESV ADGEGGAGRE NDEEGSTRGS SQTTGTPDGS GKRKSTRQQT  540
FEALTECMEK HGELLASTME SASKRQCFIQ VRQCEALEAE VEVQRKHYAA SDEVNKLMCH  600
ALLEIAKVIR AFVGSTMSSR GGGRGKAALK QIVEGAAPAK KGRHQAKRQR KVVQAVPAGS  660
ARDVVEEAVV EEEITNDEDD FEDDDDETLQ KKTRVSSAEG VRINEGGEGT PSARRGSGVA  720
AANQPVFVDV ARDVARRDVG GRTKEGAASH ETAQRVLAPV NCPRTPAADM AGSSQAAVEG  780
GTLRSPTVAA RGGAVAVPGE AVEVPKGGDG VAAGEDDEAL VHRLRGQRSA THAMDAAAKL  840
WEDDNRFWNN TAVGTTKAMG GCRVSYERLK GMAEAMRYLL AATMWIMRMA GDDPRSHYDA  900
WVFVQLTAKT TLLASMNRQF DAHRHITQSA QVMTNKLGRP PPTFAPPPVY IPDWASKCGV  960
TFSHDATLAS PMEAKRLDWL GTGPPEDAAA AAEGDDKGEG G
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13438PRRRK
2330338ARPRAKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.15e-09Trihelix family protein