PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID MA_33846g0010
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Acrogymnospermae; Pinidae; Pinales; Pinaceae; Picea
Family CPP
Protein Properties Length: 1117aa    MW: 120795 Da    PI: 6.2254
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
MA_33846g0010genomeConGenIEView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.71.5e-15649687341
            TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                    +k+CnCkkskClk+YCeCfaag++C e C+C++C+Nk e
  MA_33846g0010 649 CKRCNCKKSKCLKLYCECFAAGVYCVEPCTCQECFNKPE 687
                    89**********************************976 PP

2TCR51.22.6e-16733771139
            TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                    ++k+gCnCkks ClkkYCeC++ag+ Cs+ C+Ce+CkN 
  MA_33846g0010 733 RHKRGCNCKKSMCLKKYCECYQAGVGCSDGCRCEGCKNV 771
                    589***********************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011144.5E-16647688IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163435.639648773IPR005172CRC domain
PfamPF036381.9E-11650685IPR005172CRC domain
SMARTSM011143.1E-18733774IPR033467Tesmin/TSO1-like CXC domain
PfamPF036386.0E-12735770IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 1117 aa     Download sequence    Send to blast
MEMDSPDRNL NTPSSAVQGS PFFNYLCNLS PIKQVKSVHV AQTFSELNFP PPPAVFTSPR  60
VPSQKESNFL KRSYQKDTRD EENKSDDTPC EKVKDFAVPM SDCDVPASLN PLARQIKNFN  120
SLASDQTKCE DNTMSTFASC SPSKLLEEYL ADPAEEENYT VDSSKNYLKS TSPGLVNSLV  180
VDSDKQQETV NQCSKMKDTE MIPQPNFPEE FLGTGKIATE AETSQGDEEG TFSLNSEGSM  240
KVSAVREMGH ADQNDTEGLS SVWSDVDGHH SNIMFANNCE SSETGASHDE NLMDQTAGTF  300
AFILSKGSTC DIEEWQKASS ADCSAGDPNM KLDTEQNRVV EHQTSFGCKP GNQSQRGIRR  360
RCLDFEASEA QRKTMSNRSW NSTSLMPKTD VLVNSAHDSD TNTSSSVSCE KGVTSDCKQI  420
VPLKRETGAD IGRISQSSAN VRFSSFSSKS TELTSDKSDS NIRNNGNTPI SVGIPSGIGL  480
HLNSLATTMP LNSGVNMLTS AKGSVSTQGM SPSSGVKETS SDVVGSSLPS VNSGGNISVG  540
TNSNIASGMV VSMVEKTRID QQELQSSGMT QIGVTRSTTC FRSVTVGIKP LQSRLPFKSE  600
ERNLSPLGKK RPALLDISQQ SPFGIGEEFS QSSPKKKRKS TTIGDKEGCK RCNCKKSKCL  660
KLYCECFAAG VYCVEPCTCQ ECFNKPEYED MVLGTRQQIE SRNPLAFAPK IVRGADSPPT  720
NGDECGETPS SARHKRGCNC KKSMCLKKYC ECYQAGVGCS DGCRCEGCKN VYGKKEGGSD  780
DIEENETPIE GWDKDSLEEK TEVLDVENDI LLSEQQHAKD LSPLTPSFQY SGQGKSSAKL  840
NSCGKKHFTS EDVESPTVSQ SSAKPPRSPG KILRPTKGLQ GNITAIHNRQ AGSRTSASPI  900
FTPKMDKSGQ FSPQWDCLAD ICTLTPMLHP PMRPSATSAS NVDGIDVSPF SGQHNEISSM  960
TSRPLASRHS CHVGSSLGFR QPAARSPICT SDNIHWQTPV NAKVTPVTPA LSTTACTAGN  1020
KLSDGSDFDL QSHDVSGSLE DDTPDILKNN CSPTRGLKAS SPNQKRVSPP HNYCPKEMMN  1080
RRPISSPGIR SGRKYILQSV PSFPPLTPLS GEGQANE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1865277114121Protein lin-54 homolog
5fd3_B1e-1865277114121Protein lin-54 homolog
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1633639PKKKRKS
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT1238130.0BT123813.1 Picea sitchensis clone WS04717_J09 unknown mRNA.
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP9931755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.12e-73TESMIN/TSO1-like CXC 2