PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023915620.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family CPP
Protein Properties Length: 955aa    MW: 104763 Da    PI: 6.9775
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023915620.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.32e-15521559341
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                     +k+CnCkk+kClk+YC+Cfaag +C+e C C++C+Nk e
  XP_023915620.1 521 CKRCNCKKTKCLKLYCDCFAAGIYCAEPCACQGCFNKPE 559
                     79**********************************976 PP

2TCR49.58.2e-16605643139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks ClkkYCeC++a++ Cs+ C+C++CkN 
  XP_023915620.1 605 RHKRGCNCKKSMCLKKYCECYQANVGCSSGCRCDGCKNI 643
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 955 aa     Download sequence    
MLMGSPQSAA TKTTTTSDSQ SVQESPCLFS DLSPIKPVKA AHVAQGFPRF NSPPLVFTSS  60
RISSNSETFL LKRPQDLLSS KEEISCNDDK EKKSADDPGE SKKSVTQLCR VFITDAHKDA  120
EIKKYAQAQP CSSSGCVDEY FADPVELDTV HSADSANVSL NQSSILPESS ATGLPASKNT  180
IKFDDINDMG KDAGTEVKEP LILSKQAQED LHGKPTFDVK SLNTEEQQRD GRWPTDNAGG  240
GQQGGCDSTP QSLSKPSQIV QMCEDNGENV ATIPHRPGGN LILRDPEASE NQRGLRRRCL  300
QFEEALLNTI ENSAGSLDPT NEVTTSRSPS TVAELKSLES SHADLIAASS RRQVIQSRTN  360
LYPPRYSGNS SLSISKPSSI GLHLNSIVNA VPVTCGATAR LKLAEDYMGV QGMKSASVMN  420
FHSLEKMKSC PISSNVVEKI AFIAEDVRYK TKASIAASSS LCESPHTTEP SNLLKPIEYD  480
TTPHDKRKFN SEEVDSNEKY SQVSPKKKRK KTSSTIDGDG CKRCNCKKTK CLKLYCDCFA  540
AGIYCAEPCA CQGCFNKPEY EDTVLEIRQQ IESRNPLAFA PKIVQHVTEF NNREDRNQVP  600
PSSVRHKRGC NCKKSMCLKK YCECYQANVG CSSGCRCDGC KNIYGKKEEY VATELGVTKE  660
MVRERAGEEK FEGVFNEKLE MVATKEDFLQ AELYDLHFTP LTPSFQCSSH GMAEPKSQLL  720
SRRYLQSPES ELTILSSCVD NTKAPETLNS SDMLPQSKES LNVGSYDWQL DYNNVEVMDQ  780
FSPKGDAVAN ICHLTPLSDQ PLTAMASSTS SKMREWKNIS PVQLSPGTGY VSSSGSLCWR  840
SSPITPMSRF GGTKSLQGLN PNSGFCDIRE DDTPEILKDC ATPRNSVKIS CPNWKRVSPP  900
KRHSRELGSN SSGDLKSGRK FILQGVPSFP PLTPYIDSKG STNQKASNNE DNNSR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1507511KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.17e-58CPP family protein