PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022751649.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family CPP
Protein Properties Length: 940aa    MW: 101774 Da    PI: 6.0249
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022751649.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR47.53.6e-15529567341
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                     +k+CnCkk+kClk+YC+Cfaag +C+  C+C++C+N+ e
  XP_022751649.1 529 CKRCNCKKTKCLKLYCDCFAAGIYCDGPCSCQGCFNRPE 567
                     89**********************************876 PP

2TCR48.12.4e-15615653139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCk+skClkkYCeC++a++ Cs  C+Ce+CkN 
  XP_022751649.1 615 RHKRGCNCKRSKCLKKYCECYQANVGCSIGCRCEGCKNV 653
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 940 aa     Download sequence    
MDSPEPSKSP ISSSAAASTS SSSPVQESPF SNYISSLSPI KHDKAPLVTQ AFVGLNSPPL  60
VFTSPRINTR RRPQSSSVEV SENGEGDKSK TDGPGSLERS VTVLQQGLIT DIKEDDTKNS  120
VSVQPSSSSG CVDEYLADPV EADCANSACT ISLKLKQSNN VLQSSVNGLP DSKNLNFDDK  180
NDVGREVDST QLLSGLSEKG LERKLTFDVK PLKIDNEEHA GQRISDECRK FEADMFDLSS  240
QEKECKNLDS QKVVEDHGNG CDGFLHLPPE SLQRVQAYEG FAENVEGDLD VPIHNMTHDL  300
EASEHQRGMS RRCLQFGETQ PETTANCNSS SNLANDMVTS TSLATTSEIE GLGSSHVDLS  360
ATSRKRQLVN LSQLAINMIP QCHVENSSLT VYKPSGVGLH LNSIVNAIPM GQGGTVSMNL  420
AVDSLGIQEI KSASIASCQS MENMQSQSDA FEKVSAAPQD GILEAKVSMI AGSAACESLH  480
TVESIECHTT LSTKKKLSSE DGDSNEVFNH QSPKKKRQKL SNSTDGEGCK RCNCKKTKCL  540
KLYCDCFAAG IYCDGPCSCQ GCFNRPEYED TVLETRQQIE SRNPLAFAPK IVQPFTDFPV  600
NSREDGNWKT PSSARHKRGC NCKRSKCLKK YCECYQANVG CSIGCRCEGC KNVYGKKEDY  660
CLTEEMVSRG GGEISESTVA AKKDFLHSEL CDPHYLTPPT PSFQCSDHGK NAPISRLPSS  720
RCLPSPESDP TILSYAKSPR TSDSNDMLLE TSKEKLDVGS YCEGINYNNA DVLADECHHT  780
PLLNHSSIFT GSSSSSKARE LTSLSQFRLG PRSGCISSGG SLHWHRSSFM PMSTLDGTKK  840
LQGLNSDGGL DDILEDDTPE LLKGTSTPIK SVKASSPNGK RVSPPHNLHQ LRSSSSGPLR  900
SGRKFILKAV PSFPPLTPCI DSKGSSNQSR SNFQENSSND
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1513520PKKKRQKL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-63CPP family protein