PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP041149.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family CPP
Protein Properties Length: 788aa    MW: 86189.7 Da    PI: 5.3491
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP041149.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.77.2e-16491529240
          TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                  ++k+CnCkkskClk+YCeCfaag++C e C+C++C+Nk 
  PCP041149.1 491 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKP 529
                  689**********************************96 PP

2TCR50.54e-16576614139
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                  ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  PCP041149.1 576 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNA 614
                  589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 788 aa     Download sequence    
MDTPERNQIG TPKAKFEDSP VFNYISSLSP IKPVKSVHIT QTFSSLSFAS LPSVFTSPHV  60
SSHKESRFLR RHNPSDLSKP ESSLESGNKV SVNEDAAELY INAEELHENC VPGDPVGETS  120
AEPPCEHSKF VIELPRNLKY DCGSPECDAT TSCGTEAGCE LEAAGLSAPL VPYAQKTSEN  180
GSSDGEGEAH LQNICQSGQR REGTGCDWES LISDATDLLI FDSPNASEAF RGIMQNPLDS  240
VTRFCNSLMP RLSQNDINDE QNVLVLDTVG SGEQPETEDP SSQYGEASQQ EDTELMQDHL  300
NSCMVSNQIE KEDQFNCKPG LNLHRGLRRR CLDFEMAEAR RKSLDNVPNS SSLLSQSDEK  360
ITTNDKQLVP MKPGGESARC ILPGIGLHLN ALATTSKDYK TIKRENMAYG RQLSLPNLSA  420
SAHSPTAGQG PGHESFSSAS SERDMDGTEN GVQLLQDASQ EPAFLANEEF NQNSPKKKRR  480
KFEQAGETES SCKRCNCKKS KCLKLYCECF AAGVYCIEPC SCQECFNKPI HEDTVLATRK  540
QIESRNPLAF APKVIRTADP VPEYGEESSK TPASARHKRG CNCKKSSCLK KYCECYQGGV  600
GCSISCRCEG CKNAFGRKDG SFVGTETEAD EEETEACDKS VAENHQQKNE IQKNEEQRQD  660
SALPSTPLRL SRQFSLPFSS KNKPPRSSTF NIGSSSGLYT ITIQKLGKPN ILRPESNLEG  720
HNQAVPEDEM PEILQGDGSP STGIKTASPN GKRVSPPNNE FGPSPGRRAG RKLILQSIPS  780
FPSLTPQH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1475481PKKKRRK
2476480KKKRR
3476481KKKRRK
4477481KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-158CPP family protein