PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID MDP0000248670
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Maloideae; Maleae; Malus
Family CPP
Protein Properties Length: 1003aa    MW: 109637 Da    PI: 6.6892
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
MDP0000248670genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.58.4e-16488524339
            TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                    +k+CnCkkskClk+YC+Cfaag++C+e+C C++C+N 
  MDP0000248670 488 CKRCNCKKSKCLKLYCDCFAAGVYCAETCACQGCFNI 524
                    79**********************************6 PP

2TCR50.25.2e-16565603139
            TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                    ++k+gCnCkks ClkkYCeC++a++ Cs+ C+C++CkN 
  MDP0000248670 565 RHKRGCNCKKSMCLKKYCECYQANVGCSSGCRCDGCKNV 603
                    589***********************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.2E-15486527IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163435.39487605IPR005172CRC domain
PfamPF036381.6E-11489523IPR005172CRC domain
SMARTSM011146.8E-17565606IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.6E-11567602IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 1003 aa     Download sequence    Send to blast
MDSPEIPKIK DNTSPSSDSP PVQESPVFSY ISNLSPIQPV KASHAVQGFP GLSSPPLVFT  60
SPRINTLRET SFLKSLFVVS VFAHFDSFII FTRPQYPRLS SVEKTKSHDE ARKFVDGTVD  120
SRKHITQLQM GLITDTQDCE TNSSSEGVDE YLADPMEMDS TNSCQSVNPC LKESINASES  180
EAPPEKAKED LDGKQFDAKP VKNKEHSDGE LPSSECPKVG SGLSIDNAFN GEYHQGFHNQ  240
GIGGRHQDDC DHNPPSPPGR LQIGQVYXDC TEKVGGTSKG MIGNMILHAP NKAKNDQGGM  300
HRRCLQFEEA PPCATGKGDS SSVEKVNNSE PPASTAELEI VQVPYAATSK RQMGASLPPR  360
YGGSSPLTVP KPSGIGLHLN SIVNAAPVVR GATTIKLADH YIGVQVMKSS SVVNHHLPEN  420
VRDETENSIA TSSSITQSPH TVEFERHGSP PEKSKLDSQN XDSYLELNQS SSQKKRKKTP  480
SSKDSDGCKR CNCKKSKCLK LYCDCFAAGV YCAETCACQG CFNIPDYEDT VLETRQQIEA  540
RNPLAFAPKI VQLEEEIQFT PASARHKRGC NCKKSMCLKK YCECYQANVG CSSGCRCDGC  600
KNVYGRKGDY VPIEHGVLKD TISDKAGKET FDEKLEMVAT KKDILSTDLY DSHNHTPLTP  660
SFHCSDHANN IPKSPCLPAS YLRSPESDLT IISSYEKSTR SPLRNSETGG ILLETSKELS  720
DMGYYHWRED YDNIGVADTF STRYDAAPTT CHXTPLSDLY SKASASSTSS RSDWKNASQA  780
QLCPGIQGLS SSSSLHWRSS PVTPMARLGG TKSSQGLDFD NGLYDILQEE TPEILKDSSN  840
PIKSVKVSSP NKKRVSPPHN HNHELGASSS GSLRSGRKFI LKAVPSFPPL TPCSNSEGQK  900
EMNGRDTDGV HSEYLVIPPL ATHIIVLKLS SKCLPLHVSC FSFHSGGVSG IYFLQLEAED  960
ICRHCYVGSV VRLECGNIFR YLTKNFPPFC DEQSFDLIGD TTP
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1949160314121Protein lin-54 homolog
5fd3_B1e-1949160314121Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1473477KKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAJ0101652e-37AJ010165.1 Glycine max mRNA for cysteine-rich polycomb-like protein (gpp1 gene).
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008394057.20.0CRC domain-containing protein TSO1-like
RefseqXP_017178490.20.0CRC domain-containing protein TSO1-like
RefseqXP_017178491.20.0CRC domain-containing protein TSO1-like
TrEMBLA0A498HNJ90.0A0A498HNJ9_MALDO; Uncharacterized protein
STRINGXP_008394057.10.0(Malus domestica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF85652841
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.12e-63Tesmin/TSO1-like CXC domain-containing protein