PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PON73877.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Parasponia
Family CPP
Protein Properties Length: 797aa    MW: 88609.5 Da    PI: 4.7413
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PON73877.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.86.8e-16496533340
         TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                 +k+CnCk+skClk+YCeCfaag +C e C+C+dC Nk 
  PON73877.1 496 CKRCNCKRSKCLKLYCECFAAGLYCVEPCSCQDCLNKP 533
                 79**********************************96 PP

2TCR50.93e-16598637140
         TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                 ++k+gCnC+ks ClkkYCeCf+ g+ Cs +C+C +CkN+ 
  PON73877.1 598 RHKRGCNCRKSGCLKKYCECFQGGVGCSLSCRCMGCKNTF 637
                 589***********************************86 PP

Sequence ? help Back to Top
Protein Sequence    Length: 797 aa     Download sequence    
MNVDEEEEEE VRQSKLNIWD SPVFNYISNL SPIEPVKSGH NNHTFSSLTF ASPPSLFTSP  60
QISSFSETRF AIRRHQFSDP SKPESSQSEY ESKTSEGAPV ATQSEQLECF NSGDSAREVT  120
NESSTENLEL AVEFPSTLRY DCGSPDGNLV PSDAVKMPEL ASTELSLVQF VADDSKEGNF  180
SFEREENLRG ICRVEENNGI AGCSWVKVVS ESNNIFGLDS PITGDHSDEQ EPRMVDSGTI  240
SFISNVLDDN LNDMGKTEFV CPTGSCEQFG MRESGVESEG IGDTRETDQT PAMLSSTLLN  300
KLVVSDSCDL VDDKRQKYIK SNCKPSSQLY RGIRRRSLVF ERTETHERIS ICESKGSSSV  360
SIQSNYKVAS ADNDLVENKN GCSSLSKLPG IGLHLNALTN TSDETIVKIE TRVSESPQTS  420
TPKMIFNTSL TSDEVPQDIC SSHNTLERAL VPWDDEAQVL ENAYQTSECL VGEEFDHGSP  480
RKKRRKSEPV GESLACKRCN CKRSKCLKLY CECFAAGLYC VEPCSCQDCL NKPIHENTVL  540
ETRKQIESRN PLAFAPKVIR SVDGVGEFGN GFSYESWYPF LARRYGEDDT KATPASARHK  600
RGCNCRKSGC LKKYCECFQG GVGCSLSCRC MGCKNTFGQK DGMEETEFDE PKLETYEKLV  660
MDVSLETFKD QDLLITSSEI SRPPNEQQPN LIRKILLPSI PSVESSHQFG NRENPEKITL  720
ESHLQMIPED GTLETLESSC QQTRSGVKST SPNSKRVSPP YREFGSLTAR RSGRKLILRS  780
IPAYPQLSTD HESSDFP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1482486KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.18e-95CPP family protein