PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP030616.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family CPP
Protein Properties Length: 792aa    MW: 86651.5 Da    PI: 5.3526
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP030616.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.77.3e-16499537240
          TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                  ++k+CnCkkskClk+YCeCfaag++C e C+C++C+Nk 
  PCP030616.1 499 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKP 537
                  689**********************************96 PP

2TCR50.83.2e-16584623140
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                  ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN+ 
  PCP030616.1 584 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNTF 623
                  589***********************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 792 aa     Download sequence    
MDTPERNQIG TPKAKFEDSP VFNYINSLSP IKPVKSVHIT QTFSSLSFAS LPSVFTSPHV  60
SSHKESRFLR RHNPSDPSKP ESSLESGNKV SANEDAAELC INSAELHEDC VPGDSIGEAS  120
VEPSSEHSKF VIELPRNLKY DCGSPECGAT TRGTEAGCES EAADLSAPLV PYVQKTSENG  180
SSDDGAHLQN ICQSEQRNEL TGSDWESLIS DASDLLIFNS PNGSEAFRGI MQNPLDSVTG  240
FCNSLMPRLS QNEINDEQNV LVLDTVVSRE QPETEDPSSQ YGEASQPEDT ELMQDHLNNC  300
MVSSQIEKEN NKVEAPMQFN CKPGLNLHRG LRRRCLDFEM VGVRRKSLDN VPNSSSSMLS  360
QSDEKITTND KQLVPMKPGG ESVRCILPGI GLHLNALATT SKDYKTIKRE NMAYGRQLSL  420
PNLTAPANSP TAGQGPGHDS FSSASSERDM DGTENGVQLL QDASQEPAFL ANEEFNQNSP  480
KKKRHVLPKF EQAGEAESSC KRCNCKKSKC LKLYCECFAA GVYCIEPCSC QECFNKPIHE  540
DTVLATRKQI ESRNPLAFAP KVIRNADPVP EYGEESSKTP ASARHKRGCN CKKSSCLKKY  600
CECYQGGVGC SISCRCEGCK NTFGRKDGSF MGTETEADEE EAEACGKSAA ENYQQKNEIQ  660
ENEEQRQDSA LPTTPLRLSR QLVSLPFSSK NKPPRSSVFS IGSSSGLYTS QKLGILRPET  720
KFERHIQTVP EDEMPEILQG DGSPSTGIKT ASPNGKRVCP PNREFAPSPG PRAGRKLILQ  780
SIPSFPSLTP QN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1481509KKKRHVLPKFEQAGEAESSCKRCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-156CPP family protein