PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG81187.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1063aa    MW: 117397 Da    PI: 8.1717
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG81187.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix43.68e-14577648266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r+ + +++       r k ++ +W++v+++++  g+ r++++C +kw+nl +++kk+ +
  GBG81187.1 577 WSVEHIIALIRAKRDQDAHMQgmghayaRMKPREWKWQDVAQRLKNVGVDRNAEKCGKKWDNLMQQFKKVHH 648
                 9*************7777777444333367899************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1063 aa     Download sequence    
MECIGCRESW LGGFTANLEL DFSSSSSCSQ RSACRELDGR ILFRSRKLGR IGYGGYGGFC  60
LRRWKLSSSS SLAAMGERSW ENRRRSPARH YMGGRRREWE DGDGGDVRER ERIEWPMHVD  120
RISSVTNHFV RQCVRLRQSR SYRNERGVVL VVGRIPLRKV GPKHSAIQSS HGARTLKTSM  180
PSKLWQDNEH CAPLLCWRPR CLLFAAVVVD CFMADVFRRL LLLILVVLLL LLVLLFRVCI  240
LGNVPPRQRK RRSSMKDARM EARQAIPRSG GEQVGGRRCG GSLSTVGRAS QRVGYDALPP  300
HLQPLPGSSD EEEVVERRPQ TVSLGSGSTQ EWTATELCGT SGDMYEQSFT ELLRPGLGED  360
EGDGRVNLSF GLSTRRSTTP SRTVLVRPHP DDEGGQLTVV DRSARTRALA SETAGTNRNS  420
SMPQPRAASL SKGAQGRGIG VDGGTDFLDV GNGRDGREVW RDLRRDHRLR REEYITRGVE  480
RLHVGDRENE NETDDPPAEA DDDYDDDDDD DNDDDNDVEC GEGGSGHASP SLQSDMAGKG  540
GKSKPSGRNA RPRAKKGQGK GSGGEGDGDA EEKRNFWSVE HIIALIRAKR DQDAHMQGMG  600
HAYARMKPRE WKWQDVAQRL KNVGVDRNAE KCGKKWDNLM QQFKKVHHFQ SPSGGADFFQ  660
LTSKERASRG FNFTMDRAVY DEIEGSTGMN HTIHPKNVAD TGESGGVRPP STSYVDPESV  720
ADGEGGAGRE DDEEGSTRGS LQTTGTPGGS GKRKSTRQQT FEALTECMEK HDELMPSTME  780
SASKRQCSIQ EGATSHETAQ RVLAPLNRPR TPAANVAGSS HAAVEGGTLR SPAVVARGGA  840
VAVSGEAVEV PKGGGGAAAG EDDEALVHRL RGQRAATHAM DAAAKLWEDD NRFWNDTQGS  900
AIVRIIQEAR AYLVAVARRV QPPAIRRSIS LPHNSIPQHK IEDESELNAA KERALKVQTI  960
SLRAIHGWVF KSESWQRGYH LAYQYALNHA ATDIARAMWS AEDWRSLVSP MLFRTTLDVD  1020
MKLPLWFVGV NIVDRHEDDE CGISQAQLAR QKLWTAVECR TSV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1248253QRKRRS
2550558ARPRAKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-09Trihelix family protein