PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc026227.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family B3
Protein Properties Length: 1579aa    MW: 178738 Da    PI: 4.04
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc026227.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B334.34.1e-11211011695
                            EE--HHH.HTT---..--SEEEEEE.TTS-EEEEEE...EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
                     B3  16 lvlpkkfaeehggkkeesktltled.esgrsWevkliy.rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                            l lp +f+++h  +k   k+ +l++   g sW++k++  +  + ++ + +GW++ vk+ gL  g+f++F+++g s f+++v 
  Cse_sc026227.1_g020.1  21 LPLPSNFVKRHLRNK-ILKDPILKSaNGGYSWKLKMKKfDDDDHGFCFVNGWSKVVKDVGLLFGEFLLFRYVGYSVFSMHVY 101
                            789********5454.5678899994556*******655888888***************************9999988875 PP

2B342.89.6e-14209295692
                            ..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS..-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE. CS
                     B3   6 tpsdvlksgrlvlpkkfaeehggkkeesktltledesg..rsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefe 91 
                            ++s+  ++  l++p + ++  +++++e  t+tl++ +g  +++ v+   ++ks+ry+ t+GWkeF ++n++ egD +vFk++ ++++ 
  Cse_sc026227.1_g020.1 209 MTSSKKNKYKLRFPPELVALTKINNKE--TITLKNDDGyeKQMPVRSDKQNKSKRYYVTEGWKEFQQSNDISEGDECVFKFITSEDKL 294
                            566777777799999999666666444..8999998888899999999999*****************************99765554 PP

                            . CS
                     B3  92 l 92 
                            +
  Cse_sc026227.1_g020.1 295 C 295
                            4 PP

3B342.98.8e-14379467593
                            -..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS..-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SE CS
                     B3   5 ltpsdvlksgrlvlpkkfaeehggkkeesktltledesg..rsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.se 89 
                             ++s+++++  l +p +f++  g++++e  t+tl++++g  +++ v+l+ + ++ +y+l++GWkeF +  ++ egD +vFk++ + ++
  Cse_sc026227.1_g020.1 379 IMTSSNTNQYKLPFPTDFVALAGINTKE--TITLKNLNGyeKQFPVRLNKQYSRTSYYLGAGWKEFQRNTNISEGDKFVFKFITSeDK 464
                            57889999999********766777544..89999999999*******77777778*************************9976344 PP

                            E..E CS
                     B3  90 felv 93 
                            f +v
  Cse_sc026227.1_g020.1 465 F-CV 467
                            4.44 PP

4B334.63.4e-1112811368998
                             HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE.. CS
                     B3    9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy..rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefel 92  
                             +   +++l+lp +f+   g+++++   +t+++++g + ++ ++   r +s +y l+kGW  F + n++ egD +vFk++ ++++ +
  Cse_sc026227.1_g020.1 1281 N--HKSILRLPPDFVGLAGINTKK--NITVKSLDGDESQMAVRSdrRFYSLQYHLAKGWVAFMQCNNISEGDECVFKYITSEDKMC 1362
                             3..34569*******666777555..677888888777777733224445558899***********************9888877 PP

                             EEEEE- CS
                     B3   93 vvkvfr 98  
                             + k+++
  Cse_sc026227.1_g020.1 1363 LAKIIK 1368
                             777776 PP

5B334.83e-11138714721192
                             HHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE........EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE CS
                     B3   11 lksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy......rkksgryvltkGWkeFvkangLkegDfvvFkldgrsef 90  
                             + ++rl+lp +f+   g+++  +k +++++++g++ +++li       +k+  ry l+ GW+ F ++n++ egD +vFk+  ++++
  Cse_sc026227.1_g020.1 1387 GHKSRLWLPTDFVGLAGIDT--KKNIMVKSLDGNETQMTLIAcpwackKKQLTRYCLSMGWSAFKRSNNISEGDECVFKYLTSEDK 1470
                             45678********6667774..458***************88897765555566*************************9876665 PP

                             .. CS
                     B3   91 el 92  
                              +
  Cse_sc026227.1_g020.1 1471 MC 1472
                             44 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1579 aa     Download sequence    
MASPQPPFFY HILMNPSSPH LPLPSNFVKR HLRNKILKDP ILKSANGGYS WKLKMKKFDD  60
DDHGFCFVNG WSKVVKDVGL LFGEFLLFRY VGYSVFSMHV YSVNGCEKIL VPKSKAPRTN  120
VREVSVDDEV VFDDDDDVED EVEDEEDDDD VEYKARDYVE EDDEDEEDDD VDEDDDDVDF  180
GDDGDEDDGE DEDIDVDVDG NDGDPFFIMT SSKKNKYKLR FPPELVALTK INNKETITLK  240
NDDGYEKQMP VRSDKQNKSK RYYVTEGWKE FQQSNDISEG DECVFKFITS EDKLCLAKVT  300
KKKTRATEVD NDDMDDDDDG MDEEGKVEAA KDDDFIVDGD DIFDDDGMDE EGEGEDVKLV  360
DNDDEDDSVD GDEGDPSFIM TSSNTNQYKL PFPTDFVALA GINTKETITL KNLNGYEKQF  420
PVRLNKQYSR TSYYLGAGWK EFQRNTNISE GDKFVFKFIT SEDKFCVAKI TKKKTRARPL  480
PPAAEAPVTE VDDGIDDKDV DDNDVKDVDE DDELFDDGNP SFVATITPTY NRVLRMPTGF  540
GELAGFDTKE TLTLKGLDGY EKKMPLRVDQ KRYYVGMGWK DILQSTSISE GDKCVFKFIT  600
SEDKLCLAKI IKKETRARPQ LPAAEGQETE FNDDDTEDDE DVEDDNDKDE NEDVERVDDG  660
IDDDDDNDKD ENEDVEPVDV DPFFVVNITT SHKYMLRLPP EFVELAGIDG KKNIIIKCPD  720
GNERQMVLRP VKQGQSTRYC LSMGWLAFKR SNNISEGDEW VFKYIKSEDK MCLEKITKKV  780
DGIDDKDVEV NDSKVDDENV ELDVEVNDSK VDDENVELVD DGMDEKDVNG DDNKDEDENV  840
ELVDDGMDDK DVNGDDNKDE HEKVELVDDG NPFFVVTITP TRKLRLPTDF VALAGIESKE  900
NIIMKSLDEN ETQMSLRTDK RFRLTQYHLS VGWPAFKQSN NISDGDECVF KYITSEDKMC  960
LAKVTKEKTS ARPPPLAVGT PPTEVDDNDL NVDDGMDGKD VVVDDNKDED ENVELVDDGM  1020
DDKDDVVVDD NKDEDENVEL VDDGMDDKDV NGDDNKDEDE NVELVDDGMD DKDVNGDDNK  1080
DEDENVELVD DGLDGKDMVV DDNKDEDENV KLVDDGMDDK DVNGDDNKDE DENVELVDDG  1140
MDDKDVNAKI TKKETRARSP PPATESPPTE VDDDMDDKYA EVDDNKDKDE NVELVDDGMD  1200
DNDVNEYVKL VDGMDDKDTE FDNNNGEDEN ENVELVDDSM DNKDVDEYVK LVDGMDNKDE  1260
DENVEPVDDG DPFFRVTITP NHKSILRLPP DFVGLAGINT KKNITVKSLD GDESQMAVRS  1320
DRRFYSLQYH LAKGWVAFMQ CNNISEGDEC VFKYITSEDK MCLAKIIKTS ATEVDADPYF  1380
VVTMTPGHKS RLWLPTDFVG LAGIDTKKNI MVKSLDGNET QMTLIACPWA CKKKQLTRYC  1440
LSMGWSAFKR SNNISEGDEC VFKYLTSEDK MCLAKVTKEK IQTRLPSPAE VVKRKRGQLP  1500
PVTAIKMTAA ETVKMRSRRP PPVEITITVA EAVKRKRGQP PPSSPRVTKD MKRKKGPQPH  1560
VHSAEGVEVT KKPRGRPAK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
115341555KRKRGQPPPSSPRVTKDMKRKK
215351556KRKRGQPPPSSPRVTKDMKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G60142.14e-08B3 family protein