PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID POO02113.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Trema
Family CPP
Protein Properties Length: 801aa    MW: 87464.4 Da    PI: 6.0416
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
POO02113.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.96.4e-16506543340
         TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                 +k+CnCkkskClk+YCeCfaag++C e C+C++C+Nk 
  POO02113.1 506 CKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKP 543
                 89**********************************96 PP

2TCR50.54.1e-16590628139
         TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                 ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  POO02113.1 590 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNA 628
                 589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 801 aa     Download sequence    
MDMDTPKKNQ IGTPVSKFED SPVFNYINSL SPIKPVKSLH ITQTFSSLSF GSLPSVFTSP  60
HVSSHKESRF LRRYSCADPS KPELSSENGN KATTSEGIVM DTAQLYSNSS ELHENGKSEA  120
NIGEASVEPQ NEGSTFVIEL PRVLKYDCGS PDCVSMPCGV EADFATDSAD PSTSLVPHVP  180
DVPEMNSSDN EAQFQELCLS EQRKQGTVCD WESLISDAAD ILIFNSPNGT EAFKGLIQNS  240
LEPVTRFCTS FAAEFSHNEI NNEHQMQIVD PVSSEQHNGE EPFSQTGDAS SLEDVEQTQD  300
RFADTNSNRG MASNQSEIKD NEDKTCVAFA CKSVFSLHRG MRRRCLDFEA AGARRKNLED  360
GSNNSSVSSQ HDEDITANEK QPGFIRPCGE SSSRRMLPGI GLHLNALATT SKDCKITKNE  420
NLSSGIQIRL PGSTVSIHSP TAGQEGLDKS LIPISSEIDN NTPENGVQLL QDASQAPGSL  480
TNEEFNQNSP KKKRRRLDNA GETEGCKRCN CKKSKCLKLY CECFAAGVYC IEPCSCQECF  540
NKPIHEDTVL ATRKQIESRN PLAFAPKVIR SSDSVPEFGD ESSKTPASAR HKRGCNCKKS  600
SCLKKYCECY QGGVGCSISC RCEGCKNAFG RKDGSVLTGT EEQDEEETEA CEKSLVDKPL  660
QKIEIQNNEE QNPGSSLPIT PLRISRPLLP IPFSSKGKPP RSSFLTGTSS SGLYSQKHGK  720
PSILRSQPKF EKHVQTIPDD EMPEILRGDG SPTTGIKTAS PNSKRISPPH CHFGSSSPGR  780
RSGRKLILQS IPSFPSLTPQ H
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1490497PKKKRRRL
2491496KKKRRR
3493497KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-157CPP family protein