PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pd.00g400120.m01
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family CPP
Protein Properties Length: 784aa    MW: 86029.5 Da    PI: 6.7512
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pd.00g400120.m01genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR47.53.5e-15511549341
               TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                       +k+CnC++skClk+YCeCfaag++C ++C C +C Nk e
  Pd.00g400120.m01 511 CKRCNCRRSKCLKLYCECFAAGVYCVDSCACVNCYNKPE 549
                       89**********************************975 PP

2TCR52.21.2e-16597635139
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                       ++k+gCnCkkskClk+YCeCf+a++ Cs+ C+Ce+CkN+
  Pd.00g400120.m01 597 RHKRGCNCKKSKCLKRYCECFQAKVGCSSACRCEGCKNT 635
                       589***********************************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 784 aa     Download sequence    
MDRTPEANRI AATKSSSISS SPAVQESPFS NFLSNLSPLN TATTASYTQT LLGTNLPTPP  60
VVFTSPHINL QRETSFLERD DIVEAGSEVY KECNTNIVQI QNPSFEEVQL CSPSGCVDEY  120
LAYPVEVDST CSADLRSQRT NEVPRLLHSG FSPGKESNTK VCDIMFGSPE NEAIVLSDQA  180
EKDLPLSSLE MSQAAINQRD GKKTEELSRF IFEKVRESDV NSCLVSRAQN CGENAAKVNR  240
AGCQYDEKNA SSQSREDSNE RKQRVQKGLV KEGGQNERGI RRHLQFEAAK AYKFTILGNS  300
ESPNSLTHDG TNSRSPSILT NLKSLASSHF DNRASSSLQD VSCDTSQFPS SPYESFTSAQ  360
IGVNSTTSAP IHSGIGLHLN RISRSTSLSS DIFSSKKSTG YLSTPEQMLE YGSNNIATNS  420
SSILTSAVSG KIYVYVASSQ QESQAVTEAN SFTFHSTDTM KPPCHSMLVD QEAAPCEVGM  480
SASQETDEVE ELNQLSPKRK RRRDAYMSEG CKRCNCRRSK CLKLYCECFA AGVYCVDSCA  540
CVNCYNKPEF EDTVLDIRQQ IEARNPLAFT PKVVDNAIDS SPNFTEEQDL TTPSSARHKR  600
GCNCKKSKCL KRYCECFQAK VGCSSACRCE GCKNTFGVTP EPVYNRAKRW EAHPAEKLDN  660
VKGAIACIKA PAFSSASSSN SAKIPQAQLS SSQLQPSGAH YDIPHDDTPE ILEETSNPTK  720
FVKASSPNQK HVSPPQSRSR LRERSPTGLR SGRKFILQAM PSFPPLTPHR NSNEGTNEIE  780
NDDK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1498503KRKRRR
2499503RKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.14e-60CPP family protein