PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Mapoly0023s0118.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Marchantiophyta; Marchantiopsida; Marchantiidae; Marchantiales; Marchantiaceae; Marchantia
Family CPP
Protein Properties Length: 1182aa    MW: 127671 Da    PI: 6.8097
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Mapoly0023s0118.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.91.3e-15700739342
                  TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42 
                          +k+CnCkkskClk+YCeCfaag++C  +C+C+dC+Nk e+
  Mapoly0023s0118.1.p 700 CKRCNCKKSKCLKLYCECFAAGTYCVGSCTCHDCFNKPEH 739
                          89**********************************9876 PP

2TCR50.93e-16784822139
                  TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                          ++k+gCnCkks ClkkYCeC++ag+ Cse C+Ce+CkN 
  Mapoly0023s0118.1.p 784 RHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNM 822
                          589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011147.2E-15698739IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163436.129699824IPR005172CRC domain
PfamPF036384.8E-11701736IPR005172CRC domain
SMARTSM011141.6E-18784825IPR033467Tesmin/TSO1-like CXC domain
PfamPF036388.9E-12786822IPR005172CRC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009934Biological Processregulation of meristem structural organization
GO:0048444Biological Processfloral organ morphogenesis
GO:0051302Biological Processregulation of cell division
GO:0005634Cellular Componentnucleus
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1182 aa     Download sequence    Send to blast
MESPDRKDNI PAPVDQGSPF FKYLCNLSPI KSTRPRHVVQ TYAELSMPPP PAVFTSPPNP  60
NKCGGRVRRG LGVSSFVSHS SWSVDKENLP NHSYVSRSYT AVQSESSHAS SHDALAYVSH  120
QSTGIKQEVV DKNEDAPSKY EYSDVSDYLT EPVESGLYNT SKAYFMVKTS TVCEEFEFPG  180
EDHVHVASQD EPSSSQLTNP PSANEENSHD VSGITVLPQA GSDDNLFQSG DVCSKETEEV  240
IGCRPSCESN DERNGDGSET RSSDDSLQLD QETAAMAFLL VGEVDGEGGS GSDWSAAHQA  300
ALPEFSQQQS PDFSAVERIS DGVHPRGREE FDGGASNGLV ENRLAVEELR GTGNDDNRNE  360
LVGGPFGQRG VRRRCLDFEN ARRKSMGGGV SRKSLGQRPK SVVGPSELTT SRNVPPSVLS  420
ERAAATSDAT NSCPSVGSTE EPQSNTPGQP RPILSGTNLK SPGLLRLTSF QRDISSTEKL  480
VVETTRTVIE SSTNRTAPSG IGLHLNSLTG NTGLHLNSLT GSISFSPNNR DVSQSLGIGS  540
ASKGAVASLL GISGMSGNPN EIRCEGFTTS MAGIGPGTKC FAMNSSTSLY TSGVSVPFLE  600
RNRMPSRDTT PIMDMAREIA KSNDIHPSTG GSRQSHDESR SLGSDVLDKK ACPIGKKRLL  660
HHDALQQSNL DYAEEYDSPR SPKKKRKSPT SAGEKSGEGC KRCNCKKSKC LKLYCECFAA  720
GTYCVGSCTC HDCFNKPEHE DTVLATRQQI ESRNPLAFAP KIIHAADSSP KRGEEVIDTP  780
ASARHKRGCN CKKSLCLKKY CECYQAGVGC SEGCRCEGCK NMYGRKEANE EADEKEILLE  840
ALEKESQEDF DDLSKPEHAS ALILEQHRHM GKDLSPITPS VQYTGQGRAI GRTRSLGKKR  900
SSSPLSQPCG GLLQRSPTDS SQSPLNLCGD GNFSAHQNPE FQLSKIERSP NSTPKFTRIS  960
QLSPRWERLG DICTLTPLPQ APLRPTPASI STLERTGASP CFNRQSMEAV SSYQSDLADF  1020
TSLGKPSSFQ LPPDDSPPSP SNRRMSQSTS RQTPCTPSLS SSAFTGKEEM KFGDSRSGQF  1080
ACYSQDDDDT PDFLRYSEDA HSPLISKTSS PKQKRVSPPH FEGLREKCRG DSRNIFDSPA  1140
RSSPGLRSSR KFILTAISGL PSPHSMGPTL QGHIGNSQKK G*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A9e-1970383914131Protein lin-54 homolog
5fd3_B9e-1970383914131Protein lin-54 homolog
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1678686PRSPKKKRK
2681687PKKKRKS
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2R6XCG80.0A0A2R6XCG8_MARPO; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP9931755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.18e-66TESMIN/TSO1-like CXC 2