PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG73989.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family HB-other
Protein Properties Length: 5558aa    MW: 590547 Da    PI: 5.4577
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG73989.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox536e-1731653216657
                  S--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
    Homeobox    6 tftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57  
                    t+ q e+Le+++++++yp+ + r+eLA++lgL+e+q ++WF  rR k++k
  GBG73989.1 3165 MKTPAQKEVLERYYAEDKYPTDTVRAELATQLGLSEKQLQMWFSHRRRKDRK 3216
                  6799**********************************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 5558 aa     Download sequence    
MVWFGAPEVP DHGMVWSPRG AGPLDGPTTQ GRRTVAWSDV PDAPDDWMVR RPRGDEPRPS  60
PRASPCNDPN TEWHVAPDHF PVWYGESGTP DHVAAVADTA ETPDQDLLCP GGSGTPDHAM  120
AGSGLPKPSE LTMAWPGVAE TPDQSMAWSG LRQTTIWHGP DRSVAWSSAP EPPNHATVQF  180
SVIEVADYAT AWSVVPEPPD HAMLRSGVGE VSEHTMARSG APEPADRAVV WSGVVEAADH  240
TSPEPVNRNM QWSGGAEAPD HTTAWSGVPE PPDHGTPRTG VVEAPEHTMA CHGLASLRRQ  300
SIPWHGLESR SRRTMPCYGL ASWRRRTIPV PEHTVAWPGV PTATKHAIAW SVVLTATKGK  360
QRRSLWDSGS KGDFIHPRVV KEARLPRTGS PSPISVTLGD DKTQRFFDQT VTDLPFFLTL  420
EQTDRSPASR RHCSSAHIDV METRYDFILG TPWSRRFRST EADWATNTLV LKTKCGQTYR  480
VPFIGTTATP RPDPPPPEPS VPTPSPSITV TSPRQFAHFI RQDDVTFFMV NVTDLLHYDP  540
PCPDAELLPL EPDPPSISMT LISTFVPPPS VESTLPSRAD ADAEELAQYT ADLELAIRDL  600
IREYHDVFPS SFSLSIPEAT ELKRQLEELL RLGFIKPSNS PWGAPVLFAR KADGTLRLCI  660
DYRSLDRYTV KNSYPMPRFD ELFDRLAGNC FFTKIDLCSG YHQIRVAAAD QPKTAFRSHF  720
GHYEFTVMPF GLTNAPATFK RAMNDILRDI LEQYVLVYLD DILVYSRTLE EHLRHLRDIL  780
DRLRRHGFYA KLSKCRFAQH KVDFLGHYVS HQGLHMDDVK ITAIAEWPAP TSAKQLRSFL  840
GLTSYYSGQS DEERKRGKGR GRREGGGGGG GGGGGGGGLG GGGGGLGGGG GGGGLGGEGG  900
EGGVGVRVRV HGTTVVPLSS LRGSGVVKAP DHWMVRCRRS AGPSTRHVVA AADTAHPTVA  960
AAAAVGADAV AAAAGAPVGS HCWCSCCAPV FPPMVRRRRG ARPSDGPASS RRWTIEASVT  1020
AAADAAHPTV AAAAAVGVAA VAAAAGAHPT LLCTSLPSDG LASSRCQTIE VSVTTAVDAA  1080
HPTVAAGAVA EAVGAHPTVA DAAHPVEASV TAVADAAHPT VAAGAAIRVD AVVTAAGAPC  1140
GSHCWCSWCA PVFPPMVRRR RGARPSDGTA SSTHRTIEAL VTAAVDATYP TVAAVAAAAG  1200
AHPTVADAAH PIELSVKAAA DAAHPTVAAA AAVGVDAVGV DAVGASAAAL VGSHCWCSCC  1260
APVFPSMVRR RQGARPSSGP ASSRRRTIDA FGTADADAAV PIAVAATAVG GVPAIVTSKQ  1320
GFEDMAQLLC PCLPTKGPAS SRRQTIGWSG VVDKPDHRGI EFLRLQQLLV LIPLLLMLLI  1380
LSRRRSPQLL ILLIPLSLQL LRLELMRLQQ PLVFLLGLTP SAATVPQSSL RWSGVLKAPD  1440
LRVVRRCRGA GPSTRPTQQM QTRLFPLWLP QLLSVLFLLL LSHLSKGSST WQSCCAPVFP  1500
PRVRRSQAAR PSDGPASSRR RTMAASVTAD GDAAHPTVAA AAAVGVAAVA AAAGAHPIIV  1560
DSAHPIEASV TAVADAAHPT VATAVAVGVD AVAAAAGAPS GSHCSCSCCA PVFPPMVRRR  1620
QGARPSSGPA SSRRLSIDAS GTANADAVVP IAVAAAAVGA VPVVVVTSEQ GLEYMAQLLS  1680
PCLPFEGPVS SRRQTIGWSG VVGALDHRGV AVAVGVAAVA AAVGAHPTVA DVAHPIEASV  1740
TIVVDAAYPT VAAADGAPVG SSDGPVSSTR RTIEVSVTAA AHTAHPTVAA VAVLGVAAVA  1800
AAAGAYPTIV DAAHPIEASV TVAGDAAHPT VPAVAAVGVD AVAATTGAHP TVVDAAHPIE  1860
VSVTAAADAP HPTVAAVAAV GVDTVAAAAG AHPTVDDAAH PIEASVTAAA DAAHPTVNAD  1920
AAVGVDVVAA SAGFEYMAQL LCPCLPSEGP ASPTHRTIEE SVTAVVDAAH PTVAAAAAVG  1980
VAAVAAAAGA HPTVADAAHP IKASVTAAAD AAHSTVVAAV VVGVYAVAAA AGVPARSHCW  2040
CSCCASLFPP MVRRRQGARP SGGPASSRRR TIDASGTADA DAAVPTAVAA AAVGVVPAVA  2100
VTSEQGFEYM AQLPLSSLRG SGVVKAPDHR MVRGPAAVGV AEVAATVGAH PTVVDAAHPI  2160
EASVTAVVDA AHPTVVAVGV DAVAAAAVAH PIVADAAHPI EESVTTAVDA THPTVVGAVA  2220
VGVDAVAAAA GALIGSHCWC SSCAPIFPPM VWRRQGARPS DGPASSRRRT IDTCHRADVD  2280
AVVPTMVAAA AVGAVPSVVV TSELQHPRCI NAPFDKSKKK DASQMVMLRP FEFLKRVNCY  2340
KVSNHIMTID NNLQLSTPFV PFDCDLGDFA DMVRRHKLQE MGQAVLRTTV SVNWPKAVGK  2400
SCGEPKTSAR MSKSGAASTQ AKSVSKSASL LQPKGVSATR APSHAECVAR VSLTSTGGAT  2460
DHEDDDENTE EDSHFRQWDG GRWPDGNDAK DTEGDDVEQR FRDDAHDEEG SLEDGLAEGD  2520
GSEGQEDGGH EGHGSEEEEN EDASDEDAAK REDSYGGGSE GDEGGADDDD GGNDGSQKRR  2580
EQSSRATSHD GPAEDDDVLH GTQSGGSTRL VAAKSSAAHK WGVHSWKARV VYFDDDHLEG  2640
YAYISTFDNT RETRAIPLLF RMVVTAIRGL WQSMKEPKDD WHQAKTVEER TAQRDQYQEF  2700
VWKVIRMTPD TSLRKRSLAK RFVRNWSNLL RPYMCLAASG RVVWNLVEKF FDKWEKGELP  2760
GQDGVRPVDQ EGKAGEAAGP GPAAKTVKGK KVLVHYLQSN KNSSGFYICI NDPPASAWKV  2820
FGDFTSREKE MALEKILARH IVVTSRRAFV KKQMNMQDLY VEEETEDEEI DLLLSGAHHN  2880
AMREETPSLS TLGGQQAMKE DVGGTVESTR TENVMRGTTP NTGGAREGST SKCGMKEATP  2940
GFVSLVRRVR NDIEGAADGE TQNVVHADKE SEDGPDGMDI EDDPLFHIPD DVSKQLAPGD  3000
NISYAMKEMK AGPHFLDIKY KESHEHEWGH HIAQGTDEAI RERSVHEKEG GNGLVHMECT  3060
GPGTQIVSES TMAEMRADLE GVSTAEHKKV PSTGNRERGE ENEKEEEAIK SRSVINLVDQ  3120
SGYGTVSSSW RRGERGVIPK TLGQGGGRGG GAGVRETGRV HKKMMKTPAQ KEVLERYYAE  3180
DKYPTDTVRA ELATQLGLSE KQLQMWFSHR RRKDRKEECR DDLPMGPTIG VPSRPTSGNF  3240
NCSRPSHPSH TTYANQPQQQ HQIPIQQQQL PPSAAAQQQQ QQQQHYPRQH NHCGGVAPLP  3300
PPLPPTPAAP SLSPFSTHRS PLRASPVGED YPSLVLGSEA RESNLPPPSN NYLAEYEEEA  3360
LEMDPSAGRG RRGAGGGGGG GGGGGGSGGV GNAVMPPPVT PPAAARPRLS SMPRRVSGVA  3420
GIGIGGGGIS AGAGAGVGVG VGVSGVGVGV GAVVGGVGGG SVGGGAGGGA VVGKVGGSLK  3480
RRTSSQQQQP QQMLTDNQQD QAMAGPSPLA RDLEDGAARV AAERAAIAAV EAQMPEPLRP  3540
DGPPLAVEFD PLPLGAFSFI SASGAAASAL GPAISSDSKK RKVLSKKPEG LPNEDHDPRE  3600
RKDAKPAAAV KGVAQALPNN KVVSPVPPLA NSTAPPPAPP LKKVPLSQPP GRGVGGMGPS  3660
LGKALVRTTH QTPSGSLVAP GKGSGLSSLT PPSPSMPPPP LAPPSLPPPP PHLPPSSAHP  3720
STYGSDFVHD PASADQGFPN VAMEHGGYYN SSGYGPGPDH PYSHGAADMQ LSRSPHLREA  3780
QGEEYMSPSS QDLYNGQYRT VQGALHRAGP PGQYGVGGGG ARQLPLPQAS GGTPPGHGLG  3840
VGGVGLGGVD MQGVGAGGRP VGVGRMRGVG VAIGGGAGGG MASMVLGRGR GGGRGRGGRG  3900
GGRGGYGDEG TEDGIISARE MMPVGLSPTH STVALTTVPM GGQAGAAGIQ IDDAAIAAMS  3960
HQAAMEAHAK RQAAAAEKLR QMELQEQIRL ERKRKLEEQR LLKEEEAREK RAKRELDRAV  4020
IARRKLEEQA RREEERLERE KKREQERIVR ERLKEEERVE RERRRELERL EREKQKEVVR  4080
VEKEKKKEEL RLQREEAKVR LAKEKALAKK AAREAADLVE DDYLERQEAA RARGLPYGDD  4140
ADLDNVVLPE RFPPDCVLMK QPLALPPWDE SQELVGHLFM CWSFVTTFAD VLGIWPFTLD  4200
ELAEALHDHD SRLLGEVHLA LLKTICSDLE EAARSAAASV PGGAGVVSTT AAVGHPVLEE  4260
IQAWGFNLTT WHLRLNALTW PEVLRQYALA AGFGPKRKKP APRCEVLDGD EESDGEDALG  4320
LLRSGAAVRA AAQHVLGRKA RMLQAQRRLP GRLTPGTVKY AAYQVLSVVG QEGLTVPDIA  4380
ERIQNSGLRD LRTSRTPEAT IAGALSRDPI FVKVAPSRYC VKACFRRADE DVEKGEDIVK  4440
GEGEGYGDDY EEGDDEGDAE TEVDGDEDGS EDGDVVDEDD GGEERDGSVD YRKMRNNRSL  4500
YMGSLKTAEQ EREVIGNEDA ELEHREFERP GRRLCEEETA ERLTHDEGDL EDELTREGSL  4560
QRNRAEQGRT DECEEAENNR DSVGEGCEVD ETDLGEPWVQ ALMEGEYSEL SVEERMNALL  4620
ALVNAVNEGS LLRSALEDRH EAAAALRRQM WAEGQAERRR QKEAQYAKLQ AIASGERPGT  4680
PGADSDGGGR PMAKGGGGSG DEAAHADDYM HGSGVADTGH GPFEKARARM RADIVERADE  4740
AYAIRMRPLG RDRRFNRYFR LVPCGAGECD KGWGRIFYES FKDSHWKVID TEEAFDALLR  4800
SLDTRGMREA ALHRALLRHE SKIRQAMRNE YGGKTTGAGR GKGGAVVVIK KEDREEEDPR  4860
EKCGGEQEGV GEGWKSGAVK IPPAHEVDRR CAWEKFEELE RWMSDVRTRA SAVLAADEIP  4920
RDAKGVAPDR GTRCDYCQEL VRVKEDRHCP HCHQTFDRAM MPRMLPKVFA EHLRDCEHRM  4980
KTGDPSWVQL GPPFRPPPQL QKLKAELMDV EAAIPVEAVD LNWYKHDRPS WAVQLKRAAT  5040
PKELLQVLAE LENVVQRDWL SDDFKPTWEV LCDINQTQKH DGSVWANDIS AKAGPDVAIL  5100
PWIPHTSSAV ALRIWSFDAA LLYRKDDCLP GMEHEQEAED RGKTANEVEV KRVETKKSKP  5160
ASASGRGAPR TAPMASGGAH HTGASGGGSG RGGGAGGRGG RGGRGGGGGR WRRQGGRGAQ  5220
GGRGGEGIAR GGRGTGRVEE GTRASGRGRG RGGRGGGRGR GGWRRGGRKK AEDGHRIIAT  5280
AYRRDTTRVL GRQGHANHRQ VDPRLRAGNL RSELHRSGDQ SDEEEEADRL DAGGESPDYD  5340
RANDGQDVGE ASEEQQVDDD ERDGEEEEGD DEEEDEEDVL VGHDDLDDEA GSADEDDNEE  5400
DREGGMDDED REDDDDLEEG EEGSEREEEE EEEEMEEEEE EEEEEVDGLE DEGEDHHGMD  5460
EGYGEIGKHE YMEGPEEEED YDENEEDHVG REMADPLEHD EEDDMDDEME DELDGDNIEV  5520
AGGHSGHRNG EYPSPGDRSE GTPSIGDDYE DDDEFDDR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
132103217RRRKDRKE
235793605KKRKVLSKKPEGLPNEDHDPRERKDAK
338943904RGRGGRGGGRG
438963906RGRGGRGGGRG
551995207GGRGGRGGG
652015210RGGRGGGGGR
752025210GGRGGRGGG
852435253RGRGGRGGGRG
952495259RGRGGRGGGRG
1052515261RGRGGRGGGRG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G44180.23e-74HB-other family protein