PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc024009.1_g010.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family Trihelix
Protein Properties Length: 445aa    MW: 51425.1 Da    PI: 6.8211
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc024009.1_g010.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix75.96.4e-24134231186
               trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsess 76 
                            +W++ +v++Li+a+++++e+ + +            +kk++W++vsk+m+erg  +sp+qC++k+++lnkryk+++e+ +++ ++e++
  Cse_sc024009.1_g010.1 134 KWSDPMVRLLITAVSYIGEDASMEHgggarrkyanlQKKGKWKSVSKVMAERGHFVSPQQCEDKFNDLNKRYKRLNEILGRGtSCEVV 221
                            7**************888887754334555677888**********************************************669*** PP

               trihelix  77 stcpyfdqle 86 
                            ++ +++d ++
  Cse_sc024009.1_g010.1 222 ENPSLLDLMD 231
                            ***9999997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 445 aa     Download sequence    
MEGNLSSGNM ISGSGGTFDL QGNMRLHHQQ LNSNISNSPF SMHQPHHHNP NAQSARQQGS  60
MVHASIHESF PLRMGVMQDC TRHNQTVALS DLGKGDRGKN SVSDEDEPSF TEEGHDNQNN  120
ENRGKNMSTW HRVKWSDPMV RLLITAVSYI GEDASMEHGG GARRKYANLQ KKGKWKSVSK  180
VMAERGHFVS PQQCEDKFND LNKRYKRLNE ILGRGTSCEV VENPSLLDLM DHVSDKAKDE  240
VRKILSSKHL YYEEMCSYHN GNRLHLPPDP ELQRSLRLAL RARDDHDNDG RRGSHDDLDD  300
DDHDPYMDDH DEYEDHHHHH HTQHHGLNLD QRGVFGVGDG SSKRLKQNEG SQAKNVQADM  360
NQEGVKANHL QDQWMKHRLI QLDEQKIQIQ AQMLELEKER LKWKRYCRKK DTELEMMRME  420
NERMKLENDQ MALELKRKEM STDLS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1398410KERLKWKRYCRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-102Trihelix family protein