PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PON80004.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Parasponia
Family CPP
Protein Properties Length: 801aa    MW: 87394.3 Da    PI: 6.1317
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PON80004.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.96.4e-16506543340
         TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                 +k+CnCkkskClk+YCeCfaag++C e C+C++C+Nk 
  PON80004.1 506 CKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKP 543
                 89**********************************96 PP

2TCR50.54.1e-16590628139
         TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                 ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  PON80004.1 590 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNA 628
                 589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 801 aa     Download sequence    
MDMDTPKKNQ IGTPVSKFED SPVFNYINSL SPIKPVKSLH ITQTFSSLSF GSLPSVFTSP  60
HVSSHKESRF LRRYSCSDPS KPELSSENGN KATTSEGIVM DTAQLYSNSS ELHENGKSEV  120
NIGEASVEPQ NEGSTFVIEL PRVLKYDCGS PDCVSMPSGV EADFATDSAD PSTSLVPHVP  180
DVPEMGSSDN EAQFQELCLS EERKQGTVYD WESLISNAAD ILIFNSPSGT EAFKGLIQNS  240
LEPVTRFCTS FAAEFSHNEI NNEHQMQIVD PVSSEQHNGE EPLSQTGDGS RLEDVEQTQD  300
RFADTNSNRG MASNQSEIKD NEDKTCVAFA CKSVFSLHRG MRRRCLDFEV AGARRKNLED  360
GSNNSSVSSQ HDEEITANEK QPGFIRPCGE SSSKRMLPGI GLHLNALATS SKDCKITKNE  420
NLSSGIQIRL PGSTVSIHSP TAGQEGLDKS LIPISSEIDN NTPENGVQLL QDASQAPGSL  480
TSEEFNQNSP KKKRRRLDNA GETEGCKRCN CKKSKCLKLY CECFAAGVYC IEPCSCQECF  540
NKPIHEDTVL ATRKQIESRN PLAFAPKVIR SSDSVPEFGD ESSKTPASAR HKRGCNCKKS  600
SCLKKYCECY QGGVGCSISC RCEGCKNAFG RKDGSVLTGT DEQDDEEAEA CEKSLVDKPL  660
QKIEIQNNEE QNPGSSLPIT PLRISRPLLP IPFSSKGKPP RSSFLTGTSS SGLYSQKHGK  720
PSILRSQPKF EKHVQTIPDD EMPEILRGDG SPTTGIKTAS PNSKRISSPH CHFGSSSPGR  780
RSGRKLILQS IPSFPSLTPQ H
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1490497PKKKRRRL
2491496KKKRRR
3493497KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-154CPP family protein