PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID kfl00015_0330
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae; Klebsormidiales; Klebsormidiaceae; Klebsormidium
Family CPP
Protein Properties Length: 1368aa    MW: 144431 Da    PI: 8.6656
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
kfl00015_0330genomeKFGPView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR421.8e-1312431281342
            TCR    3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42  
                     + +C+CkkskClk+YC+Cfaag++Cs  C C +C N+ee+
  kfl00015_0330 1243 CLRCKCKKSKCLKLYCDCFAAGVYCSG-CLCVNCLNTEEN 1281
                     679************************.*********986 PP

2TCR46.48e-1513191353539
            TCR    5 gCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39  
                     gC+C+ks+C+k+YC Cf+ag+ C++eC+C +C N+
  kfl00015_0330 1319 GCHCRKSQCQKEYCVCFQAGVPCTKECTCLECLND 1353
                     8*********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.5E-1412411281IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163424.31412421355IPR005172CRC domain
PfamPF036387.7E-1012451279IPR005172CRC domain
SMARTSM011142.1E-813141356IPR033467Tesmin/TSO1-like CXC domain
PfamPF036387.5E-1113191353IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 1368 aa     Download sequence    Send to blast
MDGPSIQSEP QKVLPPKSRL PSLTRSPSSS RAYPPPPVCA PVTELSPQSK DSLLQTQASL  60
GAWLEGRVHT LQECVHTLQR ELCTSDEERR RLLALNAELL RDAQVRDQLC KVQGRELWRL  120
RQILTGVLPN LSLDPPPGTV KETRLHWGET GREGQGSVRG MLESAERRMQ KAGGNVQQKG  180
PESGLIRLQE QGLVRYDEPF KLSPVPLERT SPAIVWKSSE SRAPERRHWV GMAGRGCEAG  240
AIKPPEKLSE CSEQGKLEQG EMGASAERGV LRRSVVVEGA VGKVCSESGC SKDCQAEVSL  300
PQQTPSVIGA SSKGADLEGG EPSLCPASEK HPPGTESSPV AGAKQSNPAE STEDPVASVA  360
EPQGNAGSVP PGSLLGSVDA DSMLAGRLAP SSGRRSSGIA PDMVFAMMQL AEGGQAILSP  420
LGKEGLLQAT RGRNEESGGD RRGNKGAGQT TRGGEKETAD AEKVPRAWEG REKEGQGGEL  480
RTRITEETGE QGKALDSGRK NTEDREEKIG GEEKRAAAME QGSSDWAHEV CDVKNGRSGE  540
HEELGAGRQR TRGAPEDSAG GAAEGALSAE PEKAPGQEKG DDAEREAKLL DHDAEQVSNP  600
VLETWKDEAE GGSQAICKES AVVAQVTESF CAIFAPSQEP VVTSVSGLVP NADAQESPSD  660
SLTRSFKASK TALGEDTTAQ PSHPGSALEA DLEPLSEKGH ERFTRESDAR QPSDLPIVRR  720
KRTKRPLSGV RKKARTAASA FLHVSPKSSN SGSQRLGDRG RLGDPSAPLA KPALHTLPSN  780
LDRTASKLSM DALAKAAALV ERGGLASPEA GGGTAVETAK RAREEAETGG EGSSGSGDEL  840
PISKMGRTGK GGEVTRGKQR ERHVASSVSV PKNAPAGVCG QGAGPKLRVP EVPRRLSTGL  900
SGFTKLPTGP SLIRSAGLSE RPGASNRAKL TAGKTLPNQP GVGNPLTPNP AVKRWSEPGD  960
SHWSCGKTQL LGAKPGVSKP GELTRRTDWA NVGVPSMNVV GRAGPGASHN LPHRLVQRPG  1020
GNGHVPEIAD SAKASAEKMV ISVEEAKRNG ALPQKQPREV PPEKQQPALG TAGSQSGGAP  1080
KPSQPAGFHP PETGTPKIRS AVALHRPPEA RYRGPDVSST SGAFRAFASG QPNRMRPTAF  1140
PDVQPPASAP GGTKSASGLM RKGLEDFSRL VAQRPGGSFS AQDGGGPDAA PNPQMVQFWR  1200
RQEVLAKNMR LLAENDRGSV GSVAKSDGHS CPAGKKEKDA AKCLRCKCKK SKCLKLYCDC  1260
FAAGVYCSGC LCVNCLNTEE NAAIVNERKA AIIRKEPNAF RIVEDRSVGL AVQRHVRGGC  1320
HCRKSQCQKE YCVCFQAGVP CTKECTCLEC LNDVPRLVVR SEAQPQRA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A2e-131246135214120Protein lin-54 homolog
5fd3_B2e-131246135214120Protein lin-54 homolog
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1719732RKRTKRPLSGVRKK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A1Y1HN220.0A0A1Y1HN22_KLENI; Uncharacterized protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G20110.13e-19Tesmin/TSO1-like CXC domain-containing protein