PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.0046s0021.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family CPP
Protein Properties Length: 568aa    MW: 60045.2 Da    PI: 8.3547
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.0046s0021.1.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.94.8e-14128165341
                  TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                          kk+CnCk skClk+YCeCfa+gk+C++ C+C +C+N +e
  Bobra.0046s0021.1.p 128 KKHCNCKASKCLKLYCECFASGKYCDS-CNCVQCNNIKE 165
                          79*************************.********876 PP

2TCR52.78.2e-17199238140
                  TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                          +++kgCnCkks+ClkkYCeCf+ag +C+ +CkC++CkN e
  Bobra.0046s0021.1.p 199 RHNKGCNCKKSNCLKKYCECFQAGIFCATTCKCQECKNYE 238
                          589***********************************87 PP

Sequence ? help Back to Top
Protein Sequence    Length: 568 aa     Download sequence    
MNSSARLADD EQAERAPSTT ASREPYQFQN GAALSVSTTS AFTALNPTNP PPGHSDQGHL  60
YQGPGSGAPP MQPKTGRPGL LSPANQTTSL HPTLQQTVAP SVTTGSRQRS AAPAARRLNT  120
GVSSAQVKKH CNCKASKCLK LYCECFASGK YCDSCNCVQC NNIKEYDDLR SSAMNTILER  180
NPNAFRPKIH GGLEDDSCRH NKGCNCKKSN CLKKYCECFQ AGIFCATTCK CQECKNYEGS  240
DAREVVLRVG ASVAAAAMKR AESSGLTAQD MLDARQIANT GGTGVTVSTG TPSPHQPEKR  300
RRLNAPPVVS SSQAAVATMQ STVPVGLPAA AQVQQTPTIA QGPVAVPAYQ AAVPGAEFQP  360
AISDHKCRLK NGLLSMLHGQ HGLEGLVTVL LNAVERGTHD AANTNFGQGE QLYSQQHEQK  420
ERRLLHEDLK LLMKAWDSIQ ATMATHGPSF NAMYQQATQP HYGNGQVQQY SAGHDVAPVD  480
QSVMHYLQRA TQQNGQQSNG HLQMPQQNGQ GALFAPPPVQ IGAPPAQNAL QPHGPHGHAK  540
PGPPSSGVLA ACNPTDGCAN GYACGSS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1299303KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G20110.12e-48CPP family protein