PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc014719.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family TALE
Protein Properties Length: 409aa    MW: 46085 Da    PI: 6.3572
Description TALE family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc014719.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox22.71.6e-073543872154
                            HSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHH CS
               Homeobox  21 knrypsaeereeLAkklgLterqVkvWFqNrRak 54 
                            k +yp++e++++L +++gL+ +q+ +WF N+R +
  Cse_sc014719.1_g020.1 354 KWPYPTEEDKARLVQETGLQLKQINNWFINQRKR 387
                            569*****************************87 PP

2ELK28.92.6e-10308329122
                    ELK   1 ELKhqLlrKYsgyLgsLkqEFs 22 
                            ELKh+L+++Y+++++++++E++
  Cse_sc014719.1_g020.1 308 ELKHELKQGYKEKIVDIREEIL 329
                            9*******************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 409 aa     Download sequence    
MAYHPPNDIS QDMPLEQPPF SNNSTGKDHW LNSAILRQQA GHNNSNIFLN LQTNNNNLDS  60
ASSQHHLHNN NNNNQWLSRP ILHRNISNVT GSDRNDFNDH SRNLAAGQVV VDHHNVVGGG  120
EPEVVAGDGE GGGGALMGWQ NARQKAEVLS HPLYEQLLAA HVACLRIATP VDQLPRIDAQ  180
LAQSQQVVSK YSSLGDLDHD HDHDDKELDQ FMTHYVLLLC SFKEQLQQHV RVHAMEAVMA  240
CWEIEQSLQS LTGVSPGEGT GATMSDDDED QVDSDANLFD GSLDAHDSMG FGLPTESERS  300
LMERVRQELK HELKQGYKEK IVDIREEILR KRRAGKLPGD TTSVLKSWWQ SHAKWPYPTE  360
EDKARLVQET GLQLKQINNW FINQRKRNWH SNPSSSTVLK SKRKRLVGR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1385405RKRNWHSNPSSSTVLKSKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G25220.21e-161TALE family protein