PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc029611.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family MYB
Protein Properties Length: 1045aa    MW: 120995 Da    PI: 7.2068
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc029611.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.95.3e-092068245
                           SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT.....TTS-HHHHHHHHH CS
        Myb_DNA-binding  2 grWTteEdellvdavkqlGggtWktIartmg.....kgRtlkqcksrwq 45
                           ++WTteE+  l+ +vk++G+g W+ I           +Rt++ ++ +w+
  Cse_sc029611.1_g020.1 20 QKWTTEEEVALLAGVKKHGSGKWNMIRFDPEfasslINRTDNALRVKWH 68
                           79*******************9999955555555665899999999997 PP

2Myb_DNA-binding38.42.9e-12393443247
                            SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT.....TTS-HHHHHHHHHHH CS
        Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmg.....kgRtlkqcksrwqky 47 
                            ++WTt E+e l+dav+++G+g Wk + +  +      +R+  ++k++w+++
  Cse_sc029611.1_g020.1 393 QKWTTKEEEALYDAVAKHGSGKWKIVLSDPQfasslVNRSNTDLKNKWRTL 443
                            69**********************************************985 PP

3Myb_DNA-binding27.95.3e-09454504247
                            SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT.....TTS-HHHHHHHHHHH CS
        Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmg.....kgRtlkqcksrwqky 47 
                            ++WT eE+e l  +v+++G g W  I +  +       R+  ++k++w+++
  Cse_sc029611.1_g020.1 454 LKWTNEEEEALRAGVEKHGVGKWVDILSDSQfasslDSRSNMNLKDKWRTL 504
                            69**********************************************985 PP

4Myb_DNA-binding27.95.3e-09515565247
                            SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT.....TTS-HHHHHHHHHHH CS
        Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmg.....kgRtlkqcksrwqky 47 
                            ++WT eE+e l  +v+++G g W  I +  +       R+  ++k++w+++
  Cse_sc029611.1_g020.1 515 LKWTNEEEEALRAGVEKHGVGKWVDILSDSQfasslDSRSNMNLKDKWRTL 565
                            69**********************************************985 PP

5Myb_DNA-binding29.81.4e-09576626247
                            SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT.....TTS-HHHHHHHHHHH CS
        Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmg.....kgRtlkqcksrwqky 47 
                            ++WTteE+e l  +v+++G g W  I +  +       R+  ++k++w+++
  Cse_sc029611.1_g020.1 576 LKWTTEEEEALRAGVEKHGVGKWMDILSDSQfasslYSRSNIMLKDKWRSL 626
                            69**********************************9***********976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1045 aa     Download sequence    
MKKVGPNNML LRGSMSTQSQ KWTTEEEVAL LAGVKKHGSG KWNMIRFDPE FASSLINRTD  60
NALRVKWHRS LSKSLDVSNN INKVSTCRID DLMHLPPHPT KKRSINIRKK RVVEDFQGSV  120
AEEVSDSLSD LVEVVSDFQG GVVEEVSDSQ GGVVEELSDS QSDVDEDVSE SQVCLSEELT  180
GSQDYYHPGT LTRQKRGNCN NMLNQTRKLS SEKIPRYLEK IWLSFPEDKI SSFIHLDPLW  240
YNMYSNDSNK EKVLNWIKKK DIFSKKYVFF PIVQWRHWSV LIFCHFGESL ESKVKTPCIL  300
LLDSLEKADH SKQLEPLIRK FVLDIYMNLE RTEDAKLSRK MPFLVPKVPQ QRDGEECGFF  360
VLYYIKLFVE SAPESFSISD GYPYFERMAN EIQKWTTKEE EALYDAVAKH GSGKWKIVLS  420
DPQFASSLVN RSNTDLKNKW RTLSKKFDVP KSSLKWTNEE EEALRAGVEK HGVGKWVDIL  480
SDSQFASSLD SRSNMNLKDK WRTLSKKFDV PKSSLKWTNE EEEALRAGVE KHGVGKWVDI  540
LSDSQFASSL DSRSNMNLKD KWRTLSKKFD VPESSLKWTT EEEEALRAGV EKHGVGKWMD  600
ILSDSQFASS LYSRSNIMLK DKWRSLTKSS DVSNNRKKVR THRIHDQMHP RPHPTRKSKR  660
KRKKRVVEEV SDSQDHSVEK VRGSSTNKRS KKRVVEEVSD SHDYSAEEVS DSHDYAAEEV  720
SDSHDYSVEA VSDSHDYSVE KVRDTLTKKR SKKRVVEEVS DSHDYSVEEV SDSRDCSVEE  780
VRDSHDYSVE EVSDSHNYSV EEVSNSHDYP VDNASEIYPF EEVRDSQVYH HQCTSTRQRR  840
GTRSNMVNHA RKLNSEKVHS YLEKLWLSFS EDKKSSLAHL DPIWYNMYST GSNKEKVLKW  900
IKKKDIFSRK YVGHWSVLIF CHFGESLESK VNTPCILLLD SLEKTDHSTQ FEPLIRKFVL  960
DIYGKLKRTE DKRLLRKMPF LVPKVPQQRD GEECGYFVLY YIKLFVESAP ESFSISDGYP  1020
YFMTKDWFSS EEVDSFCKTL NSSDV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1656664RKSKRKRKK
2659665KRKRKKR
3660664RKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G17520.13e-19MYB_related family protein