PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc011023.1_g010.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family B3
Protein Properties Length: 1896aa    MW: 214862 Da    PI: 4.4358
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc011023.1_g010.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B344.42.9e-14250336692
                            ..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE..EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE. CS
                     B3   6 tpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr..yvltkGWkeFvkangLkegDfvvFkldgrsefe 91 
                            ++s+ ++++ l++p kf++   ++++e  t+tle+ +g + e+ ++ +kks++  y+ ++GWkeF ++n++ egD +vF+++ ++++ 
  Cse_sc011023.1_g010.1 250 LTSSKSNKNKLRFPPKFVALARIDTRE--TITLENYDGSEKEISVQSDKKSKStsYYVAAGWKEFQRSNDISEGDKCVFRFITSEDKI 335
                            3567777777********777777655..78888888866666666677776667*************************99766654 PP

                            . CS
                     B3  92 l 92 
                            +
  Cse_sc011023.1_g010.1 336 C 336
                            4 PP

2B339.21.3e-124064831186
                            HHTT-EE--HHH.HTT---..--SEEEEEETTS......-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-S CS
                     B3  11 lksgrlvlpkkfaeehggkkeesktltledesg......rsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldg 86 
                            + ++ l++p +f+++ g++  ++ktl+l++++g       +++ ++    + +ry+l kGWk+Fv+a ++ egD +vFk++ 
  Cse_sc011023.1_g010.1 406 THNHVLRMPTDFVRSAGID--TKKTLMLRSLDGyemampVQFDTQY--GYHVKRYFLLKGWKDFVRASNISEGDKCVFKFIT 483
                            4567899*******99999..4569********7766555555555..777788*************************875 PP

3B336.49.2e-125436281094
                            HHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EE CS
                     B3  10 vlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvv 94 
                            +  ++rl+lp +f++  g++++e   +++ +++g + ++ ++    +  s ry l++GW+ F ++n++ +gD +v+k++ ++++ ++ 
  Cse_sc011023.1_g010.1 543 TDHKSRLYLPTDFVRLAGIDTKE--NIIMISLDGSENQMAIRKgkrQPASTRYQLSEGWRAFMHSNNISQGDKCVLKYITSEDKMCLA 628
                            445568********888888655..699999999999999955544555555**************************9877665555 PP

4B3314.4e-10650741798
                            .-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE.. CS
                     B3   7 psdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy..rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefel 92 
                            + +   +++l+lp +f++   ++++e   +++++++g++ ++ ++   r ks +y l+ GW  F + n++ +gD +vFk++ ++++ +
  Cse_sc011023.1_g010.1 650 TISIAHKHMLRLPPDFVALARIDTKE--NIIMKSLDGNESQMAVRPdkRFKSAQYNLSLGWVAFKQINNISQGDECVFKYITSEDKMC 735
                            556677899*******9766766555..69999****99999996633445555***************************9999999 PP

                            EEEEE- CS
                     B3  93 vvkvfr 98 
                            +vk+ +
  Cse_sc011023.1_g010.1 736 LVKITK 741
                            999876 PP

5B3314.5e-1092710011589
                             -EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSE CS
                     B3   15 rlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrse 89  
                              l+++ +f++   ++++e++ l+++d   ++++v    r+ks +y+ ++GWkeF ++n++ +gD  vFk++ +++
  Cse_sc011023.1_g010.1  927 KLRFSPDFVALARIDTRETIILKINDGCEKEMTVHSDKRRKSTSYYVATGWKEFQQSNDISQGDKYVFKFIISED 1001
                             46777778766677777766666666666888888877999999*************************875444 PP

6B337.93.2e-12130313801390
                             TT-EE--HHH.HTT---..--SEEEEEETTS..-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE CS
                     B3   13 sgrlvlpkkfaeehggkkeesktltledesg..rsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsef 90  
                             + rl++p +f+++ g++++  ktlt+  ++g  +++ v++  ++ s+ry+ +k W+eF + n + egD +vFk++ ++++
  Cse_sc011023.1_g010.1 1303 KFRLYMPVDFVRAAGIDTK--KTLTILGLDGyeKEMPVRYDKKGESKRYYVAKEWREFKRNNAVLEGDKCVFKFITSDDK 1380
                             679*********9999954..578888888877999999977788888*************************9874433 PP

7B339.31.2e-1215651671193
                             EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS......-EEEEEE..EEETTEEEE-............TTHHHH CS
                     B3    1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesg......rsWevkliyrkksgryvlt............kGWkeF 68  
                             ff v++++++++++ l +p +f+++ g+   ++kt+tl++++g       +++ k+    + +ry+l+            +GW +F
  Cse_sc011023.1_g010.1 1565 FFVVTITPHNFNHCSLLMPVDFVRSVGIA--TKKTITLKSLDGyekeifVRFDTKY--HHRAKRYFLGkgwmdfarntniSGWMDF 1646
                             89999*********************977..5569********9988777777777..888889********************** PP

                             HHHHT--TT-EEEEEE-SSSEE..E CS
                     B3   69 vkangLkegDfvvFkldgrsefelv 93  
                             ++  ++ +gD +vFk++ ++++ ++
  Cse_sc011023.1_g010.1 1647 ARNTNISVGDKCVFKFITSEDKVCL 1671
                             ***************9987666555 PP

8B334.24.4e-1117161800892
                             -HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE. CS
                     B3    8 sdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy..rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefe 91  
                               +  +++l+lp +f++   +++ee+  +++++++g++  + l+   +++s++y l+ GW  F ++n + egD +vFk+  ++++ 
  Cse_sc011023.1_g010.1 1716 ITTIHKSMLRLPSDFVTLSRIDTEEN--IIIKTLDGNERLIALRScqQNNSSHYCLSMGWPAFMRSNSISEGDECVFKYLTSENKM 1799
                             55667789********8888888786..6666666666655554444999999*************************98755554 PP

                             . CS
                     B3   92 l 92  
                             +
  Cse_sc011023.1_g010.1 1800 C 1800
                             4 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1896 aa     Download sequence    
MADQRPPSFF HILMNPSSPH LPLPPKFVTK HLGDKILKDP LIKSANGGYL WKLKMKKIDD  60
HSYGLVDGWC NVVKDVGLLF GEFLLFRYVG SSVFLMHVYS VNGCEKFLVP KIKVPSFKVV  120
DYGCIDDDDV DDEVEEVDDV VYDVDDDGDD DDDHHHHVED EDDVEFVNGG DKDDGMEEDD  180
DVEFVNDGDK DDGMEEDDDV LYDVDDDDDD DDDDDDDDVE DEDDIEFVYD SDEGDDVDVD  240
GDDGDPFFIL TSSKSNKNKL RFPPKFVALA RIDTRETITL ENYDGSEKEI SVQSDKKSKS  300
TSYYVAAGWK EFQRSNDISE GDKCVFRFIT SEDKICLAKI TKKKTPARPL PPAAEAPVTE  360
VDTDDVGDDD DIDDMDVEDD NDKDEDEDFE LVDEDDPFFV VTITPTHNHV LRMPTDFVRS  420
AGIDTKKTLM LRSLDGYEMA MPVQFDTQYG YHVKRYFLLK GWKDFVRASN ISEGDKCVFK  480
FITGADKLCL ANITKPPPAA KPLATEVNDD DDKDENENEY ENENADVEPV DDVDPFFVAT  540
ITTDHKSRLY LPTDFVRLAG IDTKENIIMI SLDGSENQMA IRKGKRQPAS TRYQLSEGWR  600
AFMHSNNISQ GDKCVLKYIT SEDKMCLAKI NKAPATKVDV DDGDPSFVVT ISIAHKHMLR  660
LPPDFVALAR IDTKENIIMK SLDGNESQMA VRPDKRFKSA QYNLSLGWVA FKQINNISQG  720
DECVFKYITS EDKMCLVKIT KASATEVDDG MDVEDDNDTN DDTDPFFVVT IIPSHKSILL  780
LLTDFVALAR IGTKKNIIIK SLDGNETQMA LQSYELRWST RYCFSVGWMA FNRSNNISDG  840
DECVFKYIRS EDKMCLAKIT KKITRATELP AEVDNDDMDD EDVDEEGKGE DAKLVDEDDD  900
NDNEEDAYDY DGDPFFILTI SKANQYKLRF SPDFVALARI DTRETIILKI NDGCEKEMTV  960
HSDKRRKSTS YYVATGWKEF QQSNDISQGD KYVFKFIISE DKICLAKITE RKTPARSLPP  1020
AAEAPVTEVD ADDVEDDDIM DDKDTEDGND EDEDEDVELV DEDDPFFVKT ITARNLYLPI  1080
DFVRLAGIDT KENVIMISLD GNECQMGVGK DTRKPSAKYH LSKGVFAFMR SNNISQGDKC  1140
VFKYIASEDK VCLAKIIKAQ AKKVDVDDKD PSFVVTITTT HKTMLRFLSD FVRLAGIDKK  1200
KTIVFKNLNG YEKEMAVQRV NQFRSTSYSV ASGLKEFLRD TDISEGDICV FKFIRSEDKI  1260
CLAKITKKKT PARPLPPAIE PPVTEVERVD DGDPFFVATI IHKFRLYMPV DFVRAAGIDT  1320
KKTLTILGLD GYEKEMPVRY DKKGESKRYY VAKEWREFKR NNAVLEGDKC VFKFITSDDK  1380
LCLAKIKKKK TPTSCPLPPA TDDDDDDDDD EDENEDVDDE DPFFVVTINH KFMLLLPSDF  1440
VALAKIDIKE NITIKSLDGN ESQMAIQSYQ CRLIRYHFSI GWPAFRRSNN ISEGDECEFK  1500
YIRSEDKMCL VKVTKRKTPA RSLPPAAVDT ETKVDADGVE DGEGIADKDE DEDEAAELVE  1560
DDDPFFVVTI TPHNFNHCSL LMPVDFVRSV GIATKKTITL KSLDGYEKEI FVRFDTKYHH  1620
RAKRYFLGKG WMDFARNTNI SGWMDFARNT NISVGDKCVF KFITSEDKVC LAKITKARPT  1680
QVDDDDDVEV DIGMDDKDAY KDVELVDGDP FFVVTITTIH KSMLRLPSDF VTLSRIDTEE  1740
NIIIKTLDGN ERLIALRSCQ QNNSSHYCLS MGWPAFMRSN SISEGDECVF KYLTSENKMC  1800
LAKVTKTKTQ MSSPAKVVKR KKRGRPAKVV KRKRGRPAKS MMAENIKIKA AEFVKRKRGR  1860
PPASPRVTED VKRKKGSEPC GHSGDVEGVD TCVEAS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
118191834KRKKRGRPAKVVKRKR
218191849KRKKRGRPAKVVKRKRGRPAKSMMAENIKIK
318201835KRKKRGRPAKVVKRKR
418201850KRKKRGRPAKVVKRKRGRPAKSMMAENIKIK
518311861KRKKRGRPAKVVKRKRGRPAKSMMAENIKIK
618551870KRKKRGRPAKVVKRKR
718561871KRKKRGRPAKVVKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G26680.14e-13B3 family protein