PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028053362.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 936aa    MW: 101903 Da    PI: 5.2243
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028053362.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.37.4e-14567604340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     + +CnCkkskClk+YC+C aag +C+e+C+C++C+N+ 
  XP_028053362.1 567 CIHCNCKKSKCLKLYCDCLAAGIYCDETCTCQECFNRL 604
                     679*********************************85 PP

2TCR46.95.5e-15654692139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks C kkYCeC++a++ Cs  C+Ce+CkN 
  XP_028053362.1 654 RHKRGCNCKKSMCSKKYCECYQANVGCSTGCRCEGCKNV 692
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 936 aa     Download sequence    
MGSPEIDKTT TPTTTTTTTT TTNTNTSPSD SVTIQDSAIF SYISNLSPIK PVKAAPMAAG  60
FSGLGSPPSV FTSPHLNRHQ ETSYLKRPRC RQLCSAELSQ QDVRGKKIAI CSNEIEKSET  120
QGSSVLVPCT EKECENKGSV QGQATSPSGC VDDYLADSVE VDCANSVHSG SLSLIQSGDV  180
PKSLNDCNDL KEMIPKLDEK NDIGQYAEKA LGAFPATSEL AGQNFQEKSS FDNKPVETDT  240
KQGSSEMTPN ICPIVESDLS VDKALEEHYD LPVAQHVVAA HKEKLDCAIQ FLQESLQPIQ  300
GYGDSSMTAI QASDRHVENI ILHDPKGKQH SGMHRRCPWF EEAHQNIMTN SPGFGSPSNI  360
VTNSRLSTSH ADLEVFESSC LEVSAASSGR QLINLTQPMI SHRNSGSKSA VLKPSGIGLH  420
LNSIINAMPI ACGAIGSMKS AEKGYMNVEG RKLICQQPAN TKSGSILRNV REKVSASSED  480
GSYETQALAA ITSLASQSSQ IEKPSNDPAL LKQGEYQTTP CDKGKSISKR ADAVEELNRS  540
SPKKKRKKAW TTNDDDDDDD DDDDYSCIHC NCKKSKCLKL YCDCLAAGIY CDETCTCQEC  600
FNRLDYEDTV QETRQQIESR NPLAFAPKIV QHVTDSPANN NGENGNNSTP SSARHKRGCN  660
CKKSMCSKKY CECYQANVGC STGCRCEGCK NVYGRKEEYD MTKDALSKGP IHESFENTFD  720
EKLEMVSNQE GLLQTELCNP QNLMPLTPSF QFSNHWKDVS KSQFSTRRSL PSPASNVTFL  780
PPHGKSQRSP ENSDSHGMLL KARKHDEDVV SCYQGLDYSN AETVDGFSLR CDELAIGNDL  840
STLTNPPSTT IASPLSSKLS DWTTISRSQS CPVSGHLSSI GSDEKLPDTM EDNMTEILKD  900
TSTPLDALKV MSPPCIDSKD STSQNVNDPE DCSSKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1543576KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
2544577KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
3544548KKRKK
4545576KRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-55CPP family protein