PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID kfl00644_0020
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae; Klebsormidiales; Klebsormidiaceae; Klebsormidium
Family CPP
Protein Properties Length: 1842aa    MW: 190663 Da    PI: 8.3024
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
kfl00644_0020genomeKFGPView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.47e-1412251262240
            TCR    2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40  
                     ++k+C+Ckks+Clk+YCeCfaa+++C+  C C dC+N+ 
  kfl00644_0020 1225 TCKRCHCKKSRCLKLYCECFAANVYCTG-CDCLDCQNRP 1262
                     589*************************.********86 PP

2TCR50.44.3e-1613051343139
            TCR    1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39  
                     k+kkgC+Ckks C+kkYCeCf+ag+ C++ C+Ce+C N+
  kfl00644_0020 1305 KHKKGCHCKKSMCQKKYCECFQAGVACTDACRCENCANE 1343
                     689***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011147.9E-1612241264IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163431.6112251345IPR005172CRC domain
PfamPF036383.2E-1012271261IPR005172CRC domain
SMARTSM011146.7E-1613051346IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.5E-1113071343IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 1842 aa     Download sequence    Send to blast
MDSPERRAPI KPNWDGIMSS PGSAAFMDLC DTLSPIRPVP TFHASTFNEL NQIRSPASPN  60
FPTRRLKEKR FANSSAAALL LAATTDEAPK TPGLLKREPL RQTFGRNLSR IGDPSTALFA  120
GQSEEAPASA AASKQVLVVQ EGGSVGDCES KLALKGATED PVVAGARSLP AQQGQPNTRC  180
PPPKTEHMEE ATKPAEAKGE DLGAKPESSR TADDGQKASG SKRGRDEPQP DADNLAPQAA  240
APPAFLKSLK ADSDLPATVV VQPPESKRGK ASASDQDAKT VLAVPTPNPY NPCSTVIPGS  300
QQAKESVMPP IASPVPPTKL TEAAAGPLDQ EAEEKKAEGG ATRSGAEQPS LGTQLHRELS  360
FAVKQSGQVA SASERALQAQ EVASESKSQA ILPAPAAMEL TPPTPKLAAA VQDNPEVRPR  420
QTRSRTMEKG EGSGSKQRAG GTGIVLGVLP KGKRSLSNRK KSRFVSGGED SSAAEDAGGS  480
SDDTERTVSA DGAVADEGAQ RLQTSVCGAV AGAGIFGEDE AGSSPSKVDG GAGIESQQGA  540
AETALADTES VKEAGKALGS REDLDGPAKE TGLVRQSAVA VLEPCQKAAE PESARHASPP  600
RSTVLPQLKG VRSPRKGVRR GQETPPKAPA NAAPPPGGAK QTEEDVAQLL VDLSEGRTPE  660
KAKRRLIGEE ATPTSGLGQK RIAVESAPAA AEKLEGGVAG SGRTTPIQVA TAVKVEAISA  720
SVHVVSGPAV SCEPAATPRS VVFQDAKNGT SSSGVFPANQ IYFERPGALG GNTGRARRLD  780
FDADAARRQS LTARQDPRLS LEAQGSWPGF SAVQNSPAQS QMYISVANAS STPGPRNGTP  840
QSGQNPPETV KMVVKQEGGD NAWQPGQKRG ATPEGEGERA AKRERVSFSD GVYQDAKDAS  900
DATAAALSAA MGALSACNTP ATCQPTVGRP NPTPVQARPA QAPPVVARPS GGPVWKAGGN  960
ERSNLAVTPV AMHNAQPSSW PVATPPGAPS AFQHGNPSVT PGQAKQGYVT PPPRCAQPVV  1020
RPQGLASPLV TSAPGVTSAA APFNPNVAIV PKPLPLPAPV RAAKPLNLAR LGAGLSAGTV  1080
VRVTPGVGPV PPEHHTHLPG PQSAMPGPTG PPGKRASFPP TQRPQAFTST RFPGYSHYTP  1140
SYQKALPRPA PQGYGAVSMP FGQAQGPPPG MSPAVVQRQK MEAAIERKPA QPDPLSELQL  1200
DLDLEMDLVV VHESPPQQPK KPAGTCKRCH CKKSRCLKLY CECFAANVYC TGCDCLDCQN  1260
RPEFEDIVMA KKEEIERKDP HAFAPKIVEG ESPSPTKGET PASGKHKKGC HCKKSMCQKK  1320
YCECFQAGVA CTDACRCENC ANEFGTKDSH SHGPHCDHAG GGAVEAAGMR SPATPGFGHI  1380
SIQRTFPGGG KMVAKKQSLD PHGDEISALD FPHHHPTKEK RPETPPKPEP ATSAAPGTSV  1440
APATSAGAAT SAKGRGRKKD AAKGDLKTVR ENRGGAAEQT SSGVFPSISS SPARWSLASP  1500
DPSPVSRSYP IPPSPPDRRG SDTTEFLHLP ASPSPSFGSG PLLRTGTGVG GTPLSRAKVR  1560
STPLGAVIWN PADMASDPAD RRNSGQGVYS PQRTPGVSSS GGMLKLSNLS GTPTPNAVPN  1620
SALEKGGASG LQEGGGFDTG YTNPSALTCD ESVDMDGQDP LAESFFMEPC DYGVEGDLYT  1680
YDPGQMDLPA YGEDVYGEAP KESVPVSPPP KTQRTGPSFV SLKNLSVGSG RQYNEDPDDP  1740
SAALLSPQDG KRRRLSAGGR VSGEAFVSAD EATPIRISKT GSPTKVMAPA FDWGGSPGLG  1800
RMATPGKKSL NLSMLPSLPI LPARLSGGEG SGLVAPENVS QE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-19121813429120Protein lin-54 homolog
5fd3_B1e-19121813429120Protein lin-54 homolog
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
117501754KRRRL
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A1Y1IHW60.0A0A1Y1IHW6_KLENI; Tesmin/TSO1-like CXC domain containing protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP9931755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.14e-37TESMIN/TSO1-like CXC 2