PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OIS98539
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae; Nicotiana
Family CPP
Protein Properties Length: 823aa    MW: 89101.4 Da    PI: 5.9703
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OIS98539genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.83.3e-16522560240
       TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
               ++k+CnCkkskClk+YCeCfaag++C e C C+dC+Nk 
  OIS98539 522 SCKRCNCKKSKCLKLYCECFAAGVYCVEPCACQDCFNKP 560
               689**********************************96 PP

2TCR512.8e-16608646139
       TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
               ++k+gCnCkks ClkkYCeC++ g+ Cs +C+Ce+CkN 
  OIS98539 608 RHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNA 646
               589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 823 aa     Download sequence    
MSTEEEKVSL ERENVVEKES EEGRKSVKGG GVKGGVVVVM DTPERKKNQI ATPISKFEES  60
PVFNFLNNLS PIKLDKSAHF TQTLNSLSFA SIPSVFTSPH VSSIRESRFL RRHLLSDPSK  120
PEFASDNGVK VDTDEGMLDA VNNSIEPKES FGCGISGGEA PASPSHECSK LAVELARSLN  180
YECSSPSTLP TGGIRGKSLP EFAGTSVTYV PLVQEISGKG LLRCEVNMDG TNEIVQDKEE  240
ASGCDWESLI TDATDLLIFD SPGDPEAFTK ASGANFRTFE YVSNEMQNVQ HFTQVSTSEC  300
GGDGSETEKP STQPGEESQL KEYADAENTK PDASLTNDGM CVGQSEKNDN EMVSALYRGM  360
RRRCLVFEMV GPRRKHIDEG SGSSAAQEAD GNVASNDKQL VPSKTVNEYP RCTLPGIGLH  420
LNSLTASSKD GKVVKQEANA SGKQLVIASG SSITLHSSVT GQESLVKSLP ETSRGGEIVP  480
FENSLPLMED VFQAPGYVNS EESNQTSPKK KRRRLETGEG ESCKRCNCKK SKCLKLYCEC  540
FAAGVYCVEP CACQDCFNKP IHEDTVLATR KQIESRNPLA FAPKVIRTTD SLSETTGDDS  600
SKTPASARHK RGCNCKKSGC LKKYCECYQG GVGCSINCRC EGCKNAFGRK DGTDGDAEEE  660
ETDAYEKSIV DRTSHKNLVQ SDVEQNPDSG PPATPLNFGR PPMQLPFSLK NKPPRSSFLS  720
IGSSSGGIYA AGQGVGRANF FQPQPKFDKP FESVQVQEDE MPEILQGTKS PVSGIKTASP  780
NRKRVSPPHC NYGNSHSHSP GRRSSRKLIL QSIPSFPSLT PNP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1508515PKKKRRRL
2509514KKKRRR
3511515KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-130CPP family protein