PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sphfalx0037s0095.2.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Sphagnophytina; Sphagnopsida; Sphagnales; Sphagnaceae; Sphagnum
Family CPP
Protein Properties Length: 1143aa    MW: 122651 Da    PI: 7.028
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sphfalx0037s0095.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.31.9e-15666705241
                   TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                           ++k+CnCkkskClk+YCeCfaag +C ++C C++C+Nk e
  Sphfalx0037s0095.2.p 666 SCKRCNCKKSKCLKLYCECFAAGIYCVSSCACQECFNKPE 705
                           689**********************************975 PP

2TCR512.9e-16751789139
                   TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                           ++k+gCnCkks ClkkYCeC++ag+ Cse C+Ce+CkN 
  Sphfalx0037s0095.2.p 751 RHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNM 789
                           589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011142.5E-15665706IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163436.157666791IPR005172CRC domain
PfamPF036381.7E-11668703IPR005172CRC domain
SMARTSM011141.6E-18751792IPR033467Tesmin/TSO1-like CXC domain
PfamPF036388.6E-12753789IPR005172CRC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009934Biological Processregulation of meristem structural organization
GO:0048444Biological Processfloral organ morphogenesis
GO:0051302Biological Processregulation of cell division
GO:0005634Cellular Componentnucleus
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1143 aa     Download sequence    Send to blast
MERFATAETQ ADGHGSLGGP WLEKANCTNN PRSIMTTVNS GYLPSPCNKV LKHGDQAGPS  60
STRRAEALEE VKSKSCKVES VGDDLHLSEN APHGTQFSAM KTSEMDQKLP TCQLSGTSTS  120
SDSFKSQFAN TQEVVKASPL GGPYNSEKVS ERYEDHQARL RLYKEQAEAS DRGKSSQGFA  180
PEIVGSVPVV ISDHVSESQD RGLKRALVSP SHNEDQTQLG QDTTAMAYFL GGDQDSDQRN  240
REDDWSEDSS EGSRLSVHTT SECLKSQKGD MCEGITMNRN ILPAGGSSAL EEDHNLISEP  300
TTANAEQVVR LPHKVNSSGQ HRGFRRRCLD FDASVARGKI LDTGKKPLEI RNSGADVSPS  360
CASISPAVLC DRADETSTAT TSGFSMSINP LDGKEPLKAE SNYSPPAIGA ACLRGDGSSG  420
NKSGLLQMSG CCISGSLEPS SVSGGLGAPV CKTQDFRNCA GGNQNVNESN NQGCHPPVMP  480
SGIGLHLNSL TSSLSFKRDC HSSRIGNSEG TWASMLAAEP SKSASPKDDQ SSDVHTPLDM  540
GSDGLLLTGC QRFGPGVTSL LKNSQMGKFF PDSSLGSKSL KVDTLDFHSL DHESFAPLRG  600
SLSVVSGDTA ARGVSISEQE GMQWQQPDLL HQHEAESPEE FLESPQSLKK RKRKLHVASG  660
EKSGESCKRC NCKKSKCLKL YCECFAAGIY CVSSCACQEC FNKPEFEETV LNTRQQIESR  720
NPLAFAPKIV QSAKASPKIG EDTMDTPASA RHKRGCNCKK SLCLKKYCEC YQAGVGCSEG  780
CRCEGCKNMY GRKEGSREEG KEGDQVFTSQ EEPQGDDPIE LLNRMSGKSE QFRSTGNKNI  840
SPITPSFEHD GLGRSISRLR SGSRKRASDE HCSSTLLQQA GSRPSKSPTW FSNTIDGFQL  900
TANSQGAMEL SVDGGTESPL TTMNISRIEH LSPQWEGLAD ICTLTPLPLA PSRPTPASVT  960
TLDRAGESPR FSAQLIDSTY QGGSSATGHH DGLGRRLRRS PPRFRQPAAR SPLHLTQQNT  1020
CNENHLIHSR PSSASTHLPV EHCQGGKNSM LAISASGEDD DTPDFLKCPE VTSPLQTTIT  1080
KSGSPKQKRV TPPRHCDSRE QGPSKSVGGS VPSSSPGLRN GRKFTLQALP SLPPVTPPFS  1140
CS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1666878912121Protein lin-54 homolog
5fd3_B1e-1666878912121Protein lin-54 homolog
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1647654LKKRKRKL
2648653KKRKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.13e-45TESMIN/TSO1-like CXC 2