PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028053607.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 903aa    MW: 98559.3 Da    PI: 5.1199
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028053607.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.37.1e-14534571340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     + +CnCkkskClk+YC+C aag +C+e+C+C++C+N+ 
  XP_028053607.1 534 CIHCNCKKSKCLKLYCDCLAAGIYCDETCTCQECFNRL 571
                     679*********************************85 PP

2TCR46.95.3e-15621659139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks C kkYCeC++a++ Cs  C+Ce+CkN 
  XP_028053607.1 621 RHKRGCNCKKSMCSKKYCECYQANVGCSTGCRCEGCKNV 659
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 903 aa     Download sequence    
MGSPEIDKTT TPTTTTTTTT TTNTNTSPSD SVTIQDSAIF SYISNLSPIK PVKAAPMAAG  60
FSGLGSPPSV FTSPHLNRHQ ETSYLKRPRC RQLCSAELSQ QDVRGKKIAI CSNEIEKSET  120
QGSSVLVPCT EKECENKGSV QGQATSPSGC VDDYLADSVE VDCANSVHSG SLSLIQSGDV  180
PKSLNDCNDL KEMIPKLDEK NDIGQYAEKA LGAFPATSEL AGQNFQEKSS FDNKPVETDT  240
KQGSSEMTPN ICPIVESDLS VDKALEEHYD LPVAQHVVAA HKEKLDCAIQ FLQESLQPIQ  300
GYGDSSMTAI QASDRHVENI ILHDPKGKQH SGMHRRCPWF EEAHQNIMTN SPGFGSPSNI  360
VTNSRLSTSH ADLEVFESSC LEVSAASSGR QLINLTQPMI SHRNSGSKSA VLKPSEGRKL  420
ICQQPANTKS GSILRNVREK VSASSEDGSY ETQALAAITS LASQSSQIEK PSNDPALLKQ  480
GEYQTTPCDK GKSISKRADA VEELNRSSPK KKRKKAWTTN DDDDDDDDDD DYSCIHCNCK  540
KSKCLKLYCD CLAAGIYCDE TCTCQECFNR LDYEDTVQET RQQIESRNPL AFAPKIVQHV  600
TDSPANNNGE NGNNSTPSSA RHKRGCNCKK SMCSKKYCEC YQANVGCSTG CRCEGCKNVY  660
GRKEEYDMTK DALSKGPIHE SFENTFDEKL EMVSNQEGLL QTELCNPQNL MPLTPSFQFS  720
NHWKDVSKSQ FSTRRSLPSP ASNVTFLPPH GKSQRSPENS DSHGMLLKAR KHDEDVVSCY  780
QGLDYSNAET VDGFSLRCDE LAIGNDLSTL TNPPSTTIAS PLSSKLSDWT TISRSQSCPV  840
SGHLSSIGSD EKLPDTMEDN MTEILKDTST PLDALKVMSP PCIDSKDSTS QNVNDPEDCS  900
SKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1510543KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
2511544KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
3511515KKRKK
4512543KRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.16e-56CPP family protein