PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc034211.1_g010.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family C2H2
Protein Properties Length: 494aa    MW: 55170.5 Da    PI: 8.9434
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc034211.1_g010.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H214.10.0001492112323
                            ETTTTEEESSHHHHHHHHHHT CS
                zf-C2H2   3 CpdCgksFsrksnLkrHirtH 23 
                            C++C+k F r  nL+ H r H
  Cse_sc034211.1_g010.1  92 CEICNKGFQRDQNLQLHRRGH 112
                            *******************88 PP

2zf-C2H211.90.0007167189123
                            EEETTTTEEESSHHHHHHHHHHT CS
                zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                            +kC +C+k++  +s+ k H + +
  Cse_sc034211.1_g010.1 167 WKCDKCSKRYAVQSDWKAHSKIC 189
                            58*****************9876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 494 aa     Download sequence    
MKGIFLDDNM SNLTSASNEA SLSSSGNRNE IGTMYPPPHM HQSFGSVPIT TSTNNQTQSN  60
KKKRNLPGNP DPEAEVVALS PKSLMATNRF LCEICNKGFQ RDQNLQLHRR GHNLPWKLKQ  120
KSKLEVVRKK VYVCPEPSCV HHEPSRALGD LTGIKKHFSR KHGEKKWKCD KCSKRYAVQS  180
DWKAHSKICG TREYRCDCGT LFSRRDSFIT HRAFCDALAE ETARSSSSSL NHLPLHLPIN  240
FPLKSEPQHL FQNPQLSFNI GSNSPHPNHQ LPSWLDHNQH HQQQQNSSPN SLHLPSPSSH  300
MSATALLQKA AQMGVTSSNP AISMPHQHQQ SILLNGSQQN LQQDHMCAPL LSSQHHHHQQ  360
QHLSNLSSSD NNHHVILQQN NAFANVTSSC MDQLLHHTTP MTSTNVGSTN IHDIFNGMLN  420
STKQAQASSE TVRKEGGGGG GGANDELTRD FLGLRGFPNP NDHQHFLNMA SLDHMNHQLN  480
SNPNQNQNQI PWQG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1117130KLKQKSKLEVVRKK
2119132KLKQKSKLEVVRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G50700.11e-107C2H2 family protein