PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028053668.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 902aa    MW: 98430.2 Da    PI: 5.1655
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028053668.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.37.1e-14533570340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     + +CnCkkskClk+YC+C aag +C+e+C+C++C+N+ 
  XP_028053668.1 533 CIHCNCKKSKCLKLYCDCLAAGIYCDETCTCQECFNRL 570
                     679*********************************85 PP

2TCR46.95.3e-15620658139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks C kkYCeC++a++ Cs  C+Ce+CkN 
  XP_028053668.1 620 RHKRGCNCKKSMCSKKYCECYQANVGCSTGCRCEGCKNV 658
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 902 aa     Download sequence    
MGSPEIDKTT TPTTTTTTTT TTNTNTSPSD SVTIQDSAIF SYISNLSPIK PVKAAPMAAG  60
FSGLGSPPSV FTSPHLNRHQ ETSYLKRPRC RQLCSAELSQ QDVRGKKIAI CSNEIEKSET  120
QGSSVLVPCT EKECENKGSV QGQATSPSGC VDDYLADSVE VDCANSVHSG SLSLIQSGDV  180
PKSLNDCNDL KEMIPKLDEK NDIGQYAEKA LGAFPATSEL AGQNFQEKSS FDNKPVETDT  240
KQGSSEMTPN ICPIVESDLS VDKALEEHYD LPVAQHVVAA HKEKLDCAIQ FLQESLQPIQ  300
GYGDSSMTAI QASDRHVENI ILHDPKGKQH SGMHRRCPWF EEAHQNIMTN SPGFGSPSNI  360
VTNSRLSTSH ADLEVFESSC LEVSAASSGR QLINLTQPMI SHRNSGSKSA VLKPSGRKLI  420
CQQPANTKSG SILRNVREKV SASSEDGSYE TQALAAITSL ASQSSQIEKP SNDPALLKQG  480
EYQTTPCDKG KSISKRADAV EELNRSSPKK KRKKAWTTND DDDDDDDDDD YSCIHCNCKK  540
SKCLKLYCDC LAAGIYCDET CTCQECFNRL DYEDTVQETR QQIESRNPLA FAPKIVQHVT  600
DSPANNNGEN GNNSTPSSAR HKRGCNCKKS MCSKKYCECY QANVGCSTGC RCEGCKNVYG  660
RKEEYDMTKD ALSKGPIHES FENTFDEKLE MVSNQEGLLQ TELCNPQNLM PLTPSFQFSN  720
HWKDVSKSQF STRRSLPSPA SNVTFLPPHG KSQRSPENSD SHGMLLKARK HDEDVVSCYQ  780
GLDYSNAETV DGFSLRCDEL AIGNDLSTLT NPPSTTIASP LSSKLSDWTT ISRSQSCPVS  840
GHLSSIGSDE KLPDTMEDNM TEILKDTSTP LDALKVMSPP CIDSKDSTSQ NVNDPEDCSS  900
KK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1509542KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
2510543KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
3510514KKRKK
4511542KRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.16e-56CPP family protein