PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG70519.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 772aa    MW: 84389.2 Da    PI: 10.2502
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG70519.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix41.82.8e-1342113266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r+ + +++       r k ++ +W++v+++++  g+ r++++C +kw+nl +++kk+ +
  GBG70519.1  42 WSVEQIIALIRAKRDQDAHMQgmghayaRMKPREWKWQDVAQRLKNVGVYRNAEKCGKKWDNLMQQFKKVHH 113
                 9*************7777777444333367899************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 772 aa     Download sequence    
MAGKGGKSKP SGHNARSRAK KGQGKGSGGE GDSDAEEKHN FWSVEQIIAL IRAKRDQDAH  60
MQGMGHAYAR MKPREWKWQD VAQRLKNVGV YRNAEKCGKK WDNLMQQFKK VHHFQSPSGG  120
ADFFQLTSKE RASRGFNFTM DRAVYDEIEG STGMNHTIHP RNVADTGASG GVRPPSTSYV  180
DSDSVADGEG GAGREDDEEG STRASSQTTG TPDGSGKRKN TRQQTFEALT ECMEKHGELM  240
ASTMESASKR QCSIQVRQCE ALEAEVEVQR KHYAASDEWT AFSPRSCMPF ARCCAVACSR  300
RATLLTATVV VDHVTIYNAR RLPSLSSANR LGAIRRRRPV ITCRRSSSPP LPPSTSIRHS  360
SSSTFQSFVV ALTVIRHPVI RPCTHSRLFH SVFHSRRPAI IQPLRGVLVP KFCEPPTAIE  420
GAFVGSAMSS RGGGRKKAAL KQIVEGATPA KKGRHQAKRQ RKGVQVVAAG SARDVVEEAG  480
VEEEMTNDDD DFEDDDDEPL SRKARVGSAG GIRINEGGEG TPTARCGGGV AAANQPVFVN  540
VARDVARDVA RRDVAGRAKQ GAASHETAQR VLAPVNRPRT PAADVAGGSQ APVEGGTSFW  600
NDTQGSAVVR IIQEARAYLV AVARGVQPPA IRRSISLPHN SIPQHKIEDE SELNAAKERA  660
LKVQTISLKA IHGWVFKSES RKRGYHMAYQ YALNHAATDI ARAMWSAEDW RSLVSPMLFR  720
TTLNADMKLP LWFVGVNIVD RQAMDGGRVS YERLKGMAEA MRYLLTATMW IM
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
11523ARSRAKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.13e-11Trihelix family protein