PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0001539.1_g250.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family CPP
Protein Properties Length: 796aa    MW: 87333.5 Da    PI: 6.1089
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0001539.1_g250.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.87e-16499537240
                        TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                                ++k+CnCkkskClk+YCeCfaag++C e C+C++C+Nk 
  Pav_sc0001539.1_g250.1.mk 499 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKP 537
                                689**********************************96 PP

2TCR50.54.1e-16584622139
                        TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                                ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  Pav_sc0001539.1_g250.1.mk 584 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNA 622
                                589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 796 aa     Download sequence    
MDTPERNQIG TPKAKFEDSP VFNYINSLSP IKPVKSVHIT QTFSSLSFAS LPSVFTSPHV  60
SAHKESRFLR RHNTSDPSKP ESTLASGNKV SANEDAAQLY INSEELREDC VPGVSTGEDS  120
VEPSSEHSKF VIELPRNLKY DCGSPDCDPT RCGTEAHCEL EVADLSAPLV PYVQKTSEEG  180
SSNDEGHLQI ICQTVQRKEV TGCDWESLIS DAADLLIFDS PNGTEAFKGL MQNSLDPVTR  240
FCTSLAPQLT QNDVNDEQNV QVLDIVGSGG QLETEDPSSQ YGEASQLERT EQMEGHLNHC  300
MVSSQSEKED NKVETPMQFN CKPAVLNLQR GLRRRCLDFE MAGARRKSLD NVSNSSSNML  360
SQSDEKITTN DKQLVPMKPG GESSRCILPG IGLHLNALAK TSKDYKIIKC ESLAYDRQLS  420
LPNSTADIHS PTGGQGPGHE SFSSASSERD MDVAENGVQL AHDASQEPAF LANEEFNQNS  480
PKKKRHVLRR FEHAGETESC KRCNCKKSKC LKLYCECFAA GVYCIEPCSC QECFNKPIHE  540
DTVLATRKQI ESRNPLAFAP KVIRSSDSVP ELGEESSKTP ASARHKRGCN CKKSSCLKKY  600
CECYQGGVGC SISCRCEGCK NAFGRKDGSF IGTEAELDEE EAEACEKSVA EKHQQKIEIQ  660
KNEEQRPDSA LPTTPLRLSR QMVSLPFSSK NKPPRSSVFS IGGSSSGLYT SQKLGKPNIL  720
RPESKFERHS QTVPEDEMPE ILQGDGSPST GVKTASPNSK RVCPPNSDFG PSPGRRTGRK  780
LILQSIPSFP SLTPQH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1482509KKKRHVLRRFEHAGETESCKRCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-137CPP family protein