PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY44696.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family CPP
Protein Properties Length: 549aa    MW: 60702.3 Da    PI: 7.9448
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY44696.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR30.47.8e-10231277340
         TCR   3 kkgCnCkkskClkk.........YCeCfaagkkCseeCkCedCkNke 40 
                 +k+CnCk+skClk+         YCeCfaag +C e C C dC+Nk 
  GAY44696.1 231 CKRCNCKRSKCLKLvlanmrlysYCECFAAGLYCIEPCLCLDCFNKP 277
                 89***********5555555556**********************96 PP

2TCR32.61.6e-10324345122
         TCR   1 kekkgCnCkkskClkkYCeCfa 22 
                 ++kkgCnCk+s+ClkkYCeCf+
  GAY44696.1 324 RHKKGCNCKRSSCLKKYCECFQ 345
                 589******************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 549 aa     Download sequence    
MVTGSRSCSC KQSKMTEPLT QPEEIGDLKD TDQTPDILSS TLLDKLIVSD LREKMDNEEE  60
KCGQSSCKHN ILRRCLDFEM VGAHKKKVAC ESNCTSSNLS QADCKDASTE KHFVPSKSKS  120
SSSSMVAGAG LHLNALAAAS KDGKLVKCKT TASGSKVISM RICESSSLSL TSCPNLLNKS  180
EERDPVPCST EVYVVENAMQ PSVSVCSEEF NHSSPKRKRC KPEHNVESMA CKRCNCKRSK  240
CLKLVLANMR LYSYCECFAA GLYCIEPCLC LDCFNKPIHE DKVLETRRQI ELRNPLAFAP  300
KVIRSIDSTV ELGDEANKTP ASARHKKGCN CKRSSCLKKY CECFQVVFYL GIISIFCVYD  360
TKSCLCIKGG VGCSISCRCE GCKNSFGRKD VPNFAVFSPL GAEEGEYDGE ESETFAKSES  420
DANIHDNTVK EDEEEHPHHL LPSSDISRHE CYGSSPLAVE PLKVCTSPKT SKFFDQPKIE  480
EPLRPTVGEK PPKIHSFLET SSLIPVSPNS KRVSPPHHVI RSSTPRYRSR RSTLRSALSF  540
PSVTPPREH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1215222PKRKRCKP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.17e-73CPP family protein