PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim08g067150.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family CPP
Protein Properties Length: 989aa    MW: 106536 Da    PI: 6.6326
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim08g067150.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.71.5e-15550588341
                 TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                         +k+CnCkk+kClk+YC+Cfaag++C ++C+C++C+N+ e
  Sopim08g067150.0.1 550 CKRCNCKKTKCLKLYCDCFAAGVYCVDSCTCQGCFNRPE 588
                         79**********************************876 PP

2TCR50.44.3e-16637675139
                 TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                         ++k+gCnCkks ClkkYCeC++a++ Cs+ C+Ce+CkN 
  Sopim08g067150.0.1 637 RHKRGCNCKKSMCLKKYCECYQANVGCSSGCRCEGCKNV 675
                         589***********************************5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.6E-15548589IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163435.298549677IPR005172CRC domain
PfamPF036382.0E-11551586IPR005172CRC domain
SMARTSM011143.5E-17637678IPR033467Tesmin/TSO1-like CXC domain
PfamPF036381.4E-11639674IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 989 aa     Download sequence    Send to blast
MDSPEPSSKI NATINTTLSS DPGPAQDSPV FNYISNLSPI QPVKGAPIVQ DFSGLNSPPL  60
LLTSPRINTH SRSSLLKRSQ FPKLSTEVFS GKNEDYNTAI TDSDGTGVFI SPLGSGLSPF  120
VQKVSDNNIS VHEQSGTPIS CVDEFLVDVS NSESGDSSNS NNKSPKVADS IPQPPDSAED  180
SKVPVVSIPS KDERKDEIPE DAARVVVEQA EEDNKGKSPS NQKYTGVYST SNPDLPSLGL  240
CAKIVPGLDA HSSLHNHYGD RQMAQLSRAG HTVLDEASNI PIKSLETAGD CRDDNDKIST  300
MSIVPDDGIL QHDSQTKASQ HQSGISRRCL QFEDAQQKMA PASSSSQNAS GIVSCSIQPV  360
SPAVIEVVEP VSSNRSSTTS NRRLTQLVSS SVNSESLNVK VSKPSGIGLH LNSIVNGMEA  420
GSGVTVSVKS TQRGNLSIRG KKLTSMMSCH PSKNLKNCLI SANVVGSNLT SDNDGIHESY  480
RSDAESAAAS LSHNNAKLLN DTVLLKPTEH TPSNKRKLNS EHIDSNMDYN QSSPQKKRKK  540
ISDGNDGDGC KRCNCKKTKC LKLYCDCFAA GVYCVDSCTC QGCFNRPEYE DTVLDVRQQI  600
QSRNPLAFAP KIVQHSTNSP ANILGEGVAS FTPSSARHKR GCNCKKSMCL KKYCECYQAN  660
VGCSSGCRCE GCKNVFGPKE EYGIDLVNKH CITESLERSV EEEVEMVTAT SGLLQSGPIN  720
QCNSTPLTPS FRRSNNVDAS KSWFTSGRYL SSPESGQADT APYGLSPGSP RSSNNHDTHQ  780
ETIGDMLDLV TFDHELSYGN AKLANEISPG FNVTGNMDDI LALPKSQDWA SNSGGQLIPQ  840
TVHFQSTDPL SWRNSPMTHM TQFDGSGMNA LELLDSDKKP YVLEDDTPEI LKDSSIPQIG  900
VKVNSPNKKR VSPPYRHLNE IGSSSSGGGL KTGRKFILRA VPSFPPLSPC IQSKNVAAHS  960
TDNSEKDTNM CYQKSEITDC NKLFRNTK*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-2255167412120Protein lin-54 homolog
5fd3_B1e-2255167412120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1535539KKRKK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755200.0HG975520.1 Solanum lycopersicum chromosome ch08, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004245198.20.0protein tesmin/TSO1-like CXC 4 isoform X1
TrEMBLA0A3Q7HP370.0A0A3Q7HP37_SOLLC; Uncharacterized protein
STRINGSolyc08g067150.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA69802128
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.18e-63TESMIN/TSO1-like CXC 2