PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc000117.1_g010.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family CPP
Protein Properties Length: 769aa    MW: 83296.9 Da    PI: 5.7828
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc000117.1_g010.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.73.4e-16483521240
                    TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                             +k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  Cse_sc000117.1_g010.1 483 ACKRCNCKKSKCLKLYCECFAAGVYCVEPCSCHDCFNKP 521
                            589**********************************96 PP

2TCR51.52.1e-16566605140
                    TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                            ++k+gCnCkks ClkkYCeC++ g+ Cs +C+Ce+CkN+ 
  Cse_sc000117.1_g010.1 566 RHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNTF 605
                            589***********************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 769 aa     Download sequence    
MKMISVVNGE NDEESEKGIV VMDTPERTQI ANPLSKFEFG VKIYEDRFAV SLQDSPVFNY  60
INSLSPIKPV KSMHFTQTFS SLSFASLPSV FTSPHVSSLR DSKFLRRHQF LDPLKPELTS  120
EDAKKAETIE GNLNTDQNSS KQQTDFIPND SNNETSAVPL NGCSNFESSK LSYDPVSLNT  180
MNTLKMVGPS ASSASFINNV SADGSLRIEG EIEGIDALHH GKEATGCDWD SMISDASELL  240
NFNSPSEMVP YKGSGQNTLD CTDYKVDGTD NNPSEKNDGM NGAPDNHQTA DSTLNNFEVG  300
EPVEDADEGG PNVFRGMRRR CLVFEMSGSR RKHFEDSSNG SSLNTSESNQ TVLPNDNNLP  360
PLSAGNDSSR CILPGIGLHL NAIASNLVDQ KVKHESSGST RQLIIGHPSG IYGSMALATL  420
DAPSLENDMG PAQNSLLVAE DASKALNFVI SEELSQSQTS PKKKRQVTYI PVVRRSDAGD  480
GEACKRCNCK KSKCLKLYCE CFAAGVYCVE PCSCHDCFNK PIHEDTVLAT RKQIESRNPL  540
AFAPKVIKTA DPMQEDELNN TPASARHKRG CNCKKSGCLK KYCECYQGGV GCSINCRCEG  600
CKNTFGRKDG SEMDLEVNPA DESEGIGSDG SLQMVLHSET EHISATPISA TPASPSRFGR  660
QSIALVKSSK GKPPRSFLAI KGSSSSQRFG TLNPFKGVEN QQLQTVGENE IPEILEGNNE  720
SPISGVKSCS PNSKRVSPPH SGGMGQRSSR KLILQSIPSF PCLTPNSKQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1462493KKKRQVTYIPVVRRSDAGDGEACKRCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-141CPP family protein