PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0000709.1_g130.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family CPP
Protein Properties Length: 896aa    MW: 98452.2 Da    PI: 6.974
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0000709.1_g130.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR47.34.1e-15510548341
                        TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                                +k+CnC++skClk+YCeCfaag++C ++C C +C Nk e
  Pav_sc0000709.1_g130.1.mk 510 CKRCNCRRSKCLKLYCECFAAGVYCVDSCACVNCYNKPE 548
                                89**********************************975 PP

2TCR31.82.9e-10596618123
                        TCR   1 kekkgCnCkkskClkkYCeCfaa 23 
                                ++k+gCnCkkskClk+YCeCf+ 
  Pav_sc0000709.1_g130.1.mk 596 RHKRGCNCKKSKCLKRYCECFQV 618
                                589******************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 896 aa     Download sequence    
MDRTPETKRI AATTSSSISS SPAVQESPFS NFLSNLSPLN TATTASYTQT LLGTNLPTPP  60
VVFTSPHINL QRETSFLERD DIVEAGSEVY KECNTNIVQI QNPSFEEVQL CSPSGCVDEY  120
LADPVEVDST CSADLRSQRT NEVPRLLHSG FSPGKESNTE VCDIMFGSPE NEAVLLSDQA  180
EKDLPLSSLE MSQAAINQKG GKKTEELSRF IFEKVKESDV NACLVSRAQN CGENAAKLQR  240
AGCQYDEKNA SSQSRHSNEH KKRVQKGLVK EGGQNERGIC RHLQFEAAKA YKFTILGNSE  300
SPCSLTHDAT NSRSPSILTN LKSLASSHFD NRASSSPQDV SCDTSQFPSS PYESFTSAQI  360
GVNSTTSAPI HSGIGLHLNR ISRSTSLSSD ICSSKKSTGY LRMPEQMLEH GSNNIATNSS  420
SILTSAVPGK IYVHVASGQQ ESQAVTEASS FTFHSTDTMK PPCHSMLVDQ EAALCEVGMS  480
TSQEADEVEE LNQLSPKRKR RKDAYMSEGC KRCNCRRSKC LKLYCECFAA GVYCVDSCAC  540
VNCYNKPEFE DTVLDIRQQI EARNPLAFTP KVVDNAIDSS PNFTEEQDLT TPSSARHKRG  600
CNCKKSKCLK RYCECFQVYM YIYSFHFSIS TACDIRKMVF LQEFWKLVSI DMSLTYLDLT  660
SLFFFCSFPK AKVGCSSACR CEGCKNTFGV TPEPIYNRAK RWEAHPSENL DNVKGAIACI  720
KAPGINHYSP TWEGISDISK LTPLSHPCSR TAFSSASSSN SAKIPQAQLP SSQLQPSGTG  780
HFHWGCSAVI VTPELSEGQG PHDLNSDGAG AHYHIPYDDT PEILKETSNP TKVVKASSPN  840
QKRVSPPQSR SRLRERSPTG LRSGRKFILQ AMPSFPPLTP HRNSKEGTNE IENDDK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1497502KRKRRK
2498503RKRRKD
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.14e-51CPP family protein