PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG76243.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1947aa    MW: 214302 Da    PI: 8.0453
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG76243.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix31.93.4e-1012071281269
    trihelix    2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69  
                  W+ +e ++L++ +re e +l        r + k+ +W +++k+m+ +g  + +  C +kw+nl + ykki++ ++
  GBG76243.1 1207 WSTDEQMLLVRCKREQEMHLAglghnygRMRTKEWKWVDIAKRMANGGSPKDADDCMKKWDNLFQNYKKIQRFQN 1281
                  9*************66666653333333778**************************************987655 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1947 aa     Download sequence    
MVRGPNARQN GASSSAGDEE VTTAAVLQEM LNRIEAMKTG MNTMNVRLDL PRAKGELQKV  60
QSSREHRTPR PPPQHMRQED PYDALKSSRP KSTYTWLQLR GFNDWSEQGM LELNDDWIYK  120
QTGNHPKTVD VHYGQLKEML IKYGELAQLT YDATTFDSTS RYCGQCTMAP KGLFNLLSAT  180
PEHVLDGLIH IGDNCDDGGD DGGEAASSVN RTITDSESHP SNQSVPARPR SSSTSKWRYR  240
TEYKDVRYLY ATSRADGPLK PLLLLWNYLF PRRAPLFIDA PGDGATLVDT NWFGYVAISD  300
PSPGFGQQGR DLVVAWRGTQ ADLEWAQDFK GWPVPLHKDK AAFGHSKESK QSAQRRKVSV  360
AQGFHQLYTS TLSDLSLHNR GSARCQLHRN IRQLLCKYRD IDSITVVGHS LGGALAVLSA  420
YDIVDSGLNV VAVKLEQARE GDASPLDDEH PETDLGLRVI PVTVFTYAAP RVGNRAFRDA  480
FQALGKRGLK CLRVVNIHDI VPRTPGLLIN ERRSQVVQWL GRFLHVPSYF HVGKVIEVDS  540
FKPPLLTVRK YQGLGVVHNM EVLLHTLDGW RGPNAQFLMT GIRPFVLANK GVDALDPALR  600
VKAAWSTPSN NRIVFDSKHG TGFERYLDGT DGCDRYNPIR EKLIDGTFKP AVGVYSYLRT  660
KSYEWMDSFR RSFRSVVGNT PNRPNPHTGK HPTSMCDHCK KNLASEFGQI QLASPDNLMT  720
NIPERTVTSR EEFFSFEDMD IGWCGSRRRK RVVEHPHKCI VCFDGKSYID DDRTPKERLA  780
TAIDMNHPLT WQTSLRHPSS SSSSSSSSSS YVGYSGSSGF VRGHARRKFS RTSRARSPVN  840
VEFMADLQRR LPLLLLVVVV FLFAVFLHVC LSCRPWKQRR KRRASTKRAV LQEEHPMAGM  900
KARVILLAAE GGRRPATTEP SRRYDPSMYN HLPSWETPLP PSDEEPEGDE HPMFPLANRS  960
TQLLSQMVLV GGSASNERGE YTSLLQQGLG DDDDGGVDLR FGLSFGGARE ASRTVITAAP  1020
ASARGLQQPR REQTGPPTLR GGASVVGGVG SSPAARQHVS GAPSGERLTD NWDVPTGAAA  1080
ASLRTSGARS SMLNRTTAAP PEGRDEGACR PPTGLGASVE NITRGVSNMR AHSDGGDDDG  1140
CGGDDADDGF REEVEGGDDD DDNAVRPVGK TGGRGIGRNN RGGLGRSVGR GGRGGVTDDG  1200
GKSATYWSTD EQMLLVRCKR EQEMHLAGLG HNYGRMRTKE WKWVDIAKRM ANGGSPKDAD  1260
DCMKKWDNLF QNYKKIQRFQ NASGRPDFFK LSNDERKEHN FKFRMERVLY NEIHSDMLGN  1320
HTIFPPNIAD TGSPNGVQLP RRGAGGGESV DSDGGGDGCP EERSSARDSD VNACSVAGGG  1380
KRKNARQQAL ESITDVMDRH GELMSSTIES SSKRQCSISS RQCDIQERQC DILAQEVAVQ  1440
KAHYAASDEA QRMMCHALLE IAAAIRGRLT ISAASMSTRG TVCGTKRDAA ATVGLAQAKG  1500
RRHIPKSKKV RSDEASRNVP ARGFEGWAAT TEMESDDDFG MEEPQAEAMA SAVRQSARQR  1560
ASDHSAPKRM HPPAPEAQQA RGRDARKDKA AVVVAEGDDD ETLHKILQPR AAGGTATVNS  1620
APVATAREEE AVAATAREEA AVAATAREDA RGEKTTDREG GDVGPSRRLP SLVILPKSST  1680
RVARISDPSQ LQQAISRAAK VENVALRVLH GWVFKSGNRA KGYNLAYQYA LESVATDIAR  1740
AMWYAEDWSN VVSAPICGHT IDLNMDLPLW FVGAHIDDRP DDDDMAAYQE STIICIAQAF  1800
RAAVQMGAHI DGDFISYDRL CRVADCFRLL LAATMWIMRM AGDDFRNHYE AFYSVNLLAR  1860
PTLVASMHRS FDHRRSVVWA AKAVTERLGK PNPTFGEHPN YIPQWAPSDI TFGHDTSVTG  1920
PEDCKRLDWL GSGPPDDDGD DDKKEGA
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.15e-07Trihelix family protein