PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG61952.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family B3
Protein Properties Length: 3601aa    MW: 377387 Da    PI: 5.0366
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG61952.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B331.82.6e-1028672944486
                  E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-S CS
          B3    4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldg 86  
                    ++++vl++ +l++pk f++     +++s ++t+  ++g + ++   +   ++r+ +++GW++F ++++L++gD +v+++ +
  GBG61952.1 2867 CKMTPSVLRN-WLRIPKAFMSLL---PQGSGDVTFIGPNGYRCKLGWLW-SPDKRCGFSSGWRQFIRDQELQDGDWIVLEFLK 2944
                  5678888887.6********433...456779***************65.55568889*********************9654 PP

Sequence ? help Back to Top
Protein Sequence    Length: 3601 aa     Download sequence    
MGHPGGKSGL EEKKGEEDED VGCQKQGVGN GIARSGDHND GEGAVFSKRT EEGVEEAAVE  60
GNAEMARGED SPYTLTGAQG EEAGSKLEYA TNVSDPGMAH DDSRAGSSDT RSSVQTNRNG  120
KLDSEGGHDG DAKDIGKDSS VNSDKPGSNT DNRCAEDPDG HNNAPGLIAD SIPHRNVGNE  180
SANIDLGTPN SSCVMQSGTI SMISKGRDDG VRARVMGMRV DGGFGNEHGL SGDCNSDSCS  240
KPDLVKADAA MGSVLSVISE CCENVKDTST VDGECGNDND TSVMDDNLCR ASNVSGAGCG  300
VQRRGIRLTG ERADGKEAAC HMVEVNADTV MGTVVASLIC ESEDFKATGA LDNNVGTISD  360
ISVADCLVIR EHSDRGGGQV ADCSTGVFSV NGVGTSVLRD DSEGLASGDG ASSGKVDLAG  420
VACIRSLSGE NVASEEREAG SGSVQLAKRI NDKDSVGCCQ TIPDASVALP VSDAGVAPAH  480
GGIPTKGGID ADPTAVGENE ESKSLEDAAD TLLAGMNSRG ASRVTLPNGE SLEQTKGPGL  540
NVAKAALSLQ TIENGKNAPG KAAGAMEVAD ANESEKIREV AAQVRDGADG TATEDNPTDS  600
MEPVSTGQKT GKNEAVLCGK PSQDSAMVCL GDDGDGDGGK EQTAADVAGA VPGDTVEGTV  660
VHGDRFVPDP QQLMDPRVDG RDHACGVDSS GGIPLMAINN KSSTGDETDS LRDNSVNDRN  720
SCGDGSGGGG GGKQFSSCTS SVNDGDQVDT SNVNKTSAGS AGKRGSISYG DVDPKQVPPC  780
NSSDSQMGGD GVSVSQLTSG NRCERGVESG TGSGAAMFLG MAVAGSQEWT VLDVPRVEGL  840
CDNALSADAM MQHGFAMMHD DNAIDSGDGV VGDSDKCTGV EHAHCGDGDA ALGKNGQTQR  900
PVKSGSRGTC LTGSVKGFAV DSAGRGGGSR NSLCVEVARC GDDVQGAIYS ASHVEAASVI  960
HNDGSAGIDS IASISTGYFS DSCRGQGKLP RVHPDDAGNG VCDQRSLGSN EAGWNGVAGC  1020
AQDAGPATGI CSDDGGATFS PNGTVNTGRE ALQEDKATSG HVPENNDAEA RGCSGGEVTN  1080
RGSSVGMVLS LIHIVAECGE GVMKQGPANG DGVMCHNTKA TEQGAAVNGL VSMPDRKRLC  1140
ADRSEQVSLA RGHAEPDAWR HASAAASSSP TQDLQIVNCQ TCVPAAAELA SNADEGAHLE  1200
GSCVTPSIGD SAIADVDEGA TAQGRDVLDE GKSALSAVAV SLDHDVLVMG SPTQALGGEH  1260
ATPFLEGCRP TRNAKGVDEG EVRLTEHGCI GDPNDGVAKG NDDSLLLDSH APLRSSGDNR  1320
VRNRNCMDGA PCNEDGKAQQ CTACEVVRPS VLALPCSQEG ICDPSKSATG FSQPPKRNDG  1380
GLVGINGRHN SVTCDQIVIV RQHRAPGAEE SPEAGTSGWS DTVVTACLNE SGTDFHASKV  1440
KEGELVCAVN CADGAASDHK NHAGNGHSGN DGAAGAARSL LHYTQIDRHH SNLGVHCACK  1500
PGNHKPLNDC EFANDGAGSG NCIGHADCTG GSDADCQRED YAAVSLTGAL VKPTPEGQDV  1560
GDLKVFGNGD TGNLNSDNIG EGDRRPQPCS GGSGTCVAGS YAPDVSSGSG RESDASVAGI  1620
IDCSEVCADD IPMAVDGCRD THVPLVLGGQ GDSVVARLTE GYGRSSADDG KRNEHVRGGG  1680
TPIRYACNNL SGKSEDLCDQ KCNSTGQILG AMTEIAKREE KPSGVGGGQD DDLGDDHVAT  1740
HAGEAQGKSD GGDVGEDAGC GRSSGPRPVV KDDNAGEEPS SGGGGHGDLL RDCLSCHDLV  1800
ISLFENAECN CAANVPTGET CGRGNSHGLE LEFDEAVNRP PVGDGCHVAT LGNGLAKSHA  1860
RDSEKEGSAS RGEACKSSCS SEPAMSVDRP VEDSVLQGGD HEGGIPQDCC IFGDVVTSSS  1920
WDAERKSQNS DIQQNQICKK RMKSNSPEPT VKGDKAVEAP SLVGGTVEYV PSFDPFCISL  1980
GGDPEAKGAV DVHPCCEERG DLVSIIKAEE SIVEPSVGAN GILADDSPGI SPENCGVDVH  2040
TRRECERIVT PVQVEVANTN VELAPLRAGD NVAVLRENPI FDDPATSPSR IGADLDVLPG  2100
CELSTLGAGD YDEAADTKFL FVRVDDRTIN PLGHDGALGE RRAPSLHSCQ SRRTKKHQNS  2160
FGSIEHGGAG SLCLHPDTLE EVPRTGAVAS KDDSDNVSVT THAQTPAIDS ASAATATAAV  2220
MGNVGGANGG QRESDLIFNA GPTCWSVNTD GMCAPYVQPA NRQHDGGRHN HGSGDSGEDA  2280
KACVAGAASE LAYDGVAQES DIPTGGLQRT VHEERSKFCR TGVELISNST GRAFVGENGC  2340
GKISNGNVGS GSESGLEKHV SVNSVGNGIC VQRSRGKGLH VFLSVKDTSA VEGGQKGGLG  2400
QTGGCCSVDA SQRMLSIAPS FSVRLCPSLK DGTIVINSRH GSTVLSQAVE NDNHTISGRD  2460
EKGIGVAADD GDDGVEVRGE CHSSVYPVAR DLVSVERSDT KGHDSEAAEA DQQHELKQQA  2520
ACTVDDLNKT VSIPYKFQLR LYASSKEAQA AAAHCVRSLE RNRPAIWKQI GKNRRSKSPG  2580
AVALHSMPIY RSWFQKSVLG HVEKGHCEVD VVVEDWEGSR WGMTWHHRKS VLLNGWANLM  2640
NFHSLAFGDF IVFETVGPSL LKIYAFCIGD EEDGHGAIQE QDGDGEYRCG KRLLEARKRL  2700
RQAKRRKAGT VTHPYQQSPD SGLHQSSVIN YEHVHLSACN NEGESSGHAM RFISNKGIVA  2760
FKQERGAGAS LVTETHIDTL QGVGDDKAEC RRSNRKRRVK STVAALFQKK MRVEKEREEN  2820
KDDVKEHKDR KLVDQYVTLR RPILQEERKA VMEKVEAFVS DYPTFSCKMT PSVLRNWLRI  2880
PKAFMSLLPQ GSGDVTFIGP NGYRCKLGWL WSPDKRCGFS SGWRQFIRDQ ELQDGDWIVL  2940
EFLKQTVVRV NLFPIVCSSE TCAEGVPLVE TWQAVPRGYR SKDAGGHPLV VRGDNGDTAE  3000
SVMASVGRGS RGWGAADARP GWAADGALEA DPRSRARELD GGMAAEQDVK GRGRGKKQKS  3060
NHNWKISSSN VGSKVSKTEI QKNVGPCKGG RRQGRDEGAS DVGGSSSQKS QQWRDMVEAG  3120
NWCEEYREVE VSLGRAENDK ARTGRGGKFS NLTIQRSQNE LVLRIVGTNC LTDQSSDGIA  3180
KGCTNRRPCG KGKLQASPWT EERILVGDGR ASGKKSRGKV RGAREESTRS VDEGGRNKVN  3240
QVVMQAQLVE NEWVEDSKQN GLARVKNEAA GRRELDNTAP SGFVDVSNAR FGPSAIDGKN  3300
DKKDKKRKRK WRFSSLKTKR RRSLLEKMAK STDEASCGNQ GFVLGGQSVE VTEIAAEHAS  3360
AGGESTRLVR AKKTDQKTDG QRIANMRRRG GEESLPVEER SVLGTGMPSP QVVVKKERCQ  3420
GVVYGSGRKQ ITEGNSDGNL LMSTADRRPR RDAKERDGRP KDQRKQTYAE VRGGEQGPRA  3480
PRKHNHVEEN SSVGKAEGRR SLVRGAVEDR GISGYRDGWI GNGQGDDEEG EYFSVDRMLN  3540
VNWGAGKVLV TLTGLYVSGD SSRYQSGGIW VPKEWLATDF SRVFVPQSWQ LKSHASAIRG  3600
W
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
133023310KKDKKRKRK
233053310KKRKRK
333053322KKRKRKWRFSSLKTKRRR
433063323KKRKRKWRFSSLKTKRRR
533073324KKRKRKWRFSSLKTKRRR
633083325KKRKRKWRFSSLKTKRRR