PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sme2.5_04162.1_g00001.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family C3H
Protein Properties Length: 958aa    MW: 103803 Da    PI: 6.6774
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sme2.5_04162.1_g00001.1genomeEGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH21.63.7e-07681705126
                              --S---SGGGGTS--TTTTT-SS-SS CS
                  zf-CCCH   1 yktelCrffartGtCkyGdrCkFaHg 26 
                              +kt+lC+ f++ G C+y  +C+FaHg
  Sme2.5_04162.1_g00001.1 681 FKTKLCCKFRA-GVCPYITNCNFAHG 705
                              69*********.*************8 PP

2zf-CCCH41.32.6e-13816841126
                              --S---SGGGGTS--TTTTT-SS-SS CS
                  zf-CCCH   1 yktelCrffartGtCkyGdrCkFaHg 26 
                              +kt++C+ +  tG+C++G++C+FaHg
  Sme2.5_04162.1_g00001.1 816 WKTRICNKWEMTGYCPFGNKCHFAHG 841
                              8************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF536131.24E-19155296IPR029056Ribokinase-like
CDDcd011689.59E-81157539No hitNo description
Gene3DG3DSA:3.40.1190.208.7E-52197279IPR029056Ribokinase-like
PfamPF002943.1E-11198286IPR011611Carbohydrate kinase PfkB
SuperFamilySSF536137.48E-34347538IPR029056Ribokinase-like
PfamPF002942.7E-21364538IPR011611Carbohydrate kinase PfkB
Gene3DG3DSA:3.40.1190.208.7E-52367536IPR029056Ribokinase-like
PROSITE patternPS005840493506IPR002173Carbohydrate/puine kinase, PfkB, conserved site
SuperFamilySSF902292.35E-7678711IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.109.3E-14678713IPR000571Zinc finger, CCCH-type
SMARTSM003560.014680706IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010312.418680707IPR000571Zinc finger, CCCH-type
PfamPF006422.2E-4681705IPR000571Zinc finger, CCCH-type
SuperFamilySSF902292.75E-6755783IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.101.8E-8755784IPR000571Zinc finger, CCCH-type
SMARTSM003560.12756783IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010313.056756784IPR000571Zinc finger, CCCH-type
SuperFamilySSF902295.49E-9812843IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.103.0E-15812842IPR000571Zinc finger, CCCH-type
SMARTSM003568.8E-9815842IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010315.225815843IPR000571Zinc finger, CCCH-type
PfamPF006423.3E-10816841IPR000571Zinc finger, CCCH-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0015979Biological Processphotosynthesis
GO:0009507Cellular Componentchloroplast
GO:0016773Molecular Functionphosphotransferase activity, alcohol group as acceptor
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 958 aa     Download sequence    Send to blast
MSFSLLSLSS SSSSAVSRTF PSFFSSETSS PPSTLLTLLR NQSNFSHVGS SNRIFGSSSC  60
SFFSSGCSLS AADYVVPRAF SSSFESEFSS SRRNSEGESS DIGSGENGEQ GAETDDEDQG  120
HVLSEIKFRH IMSNHLPQYF FGLPLPLCVA PKTSLYVDFS GMVDDEFLER LGLEKGTRKV  180
VNHEERGKVL SAMDGCSYKA AAGGSLSNSL VALARLGGQT TAGPALNVAL AGSVGSDPLG  240
GFYRSKLRRA NVNFLSAPVK DGTTGTVIVL TTSDAQRTML AYQYAQLVPG ADDPWFKGKS  300
SLINSSDEPF SFCFLVLTCN DPVKVGSSLY RFFSYFKSTR PSVIAAFCFG MGTSSRINYD  360
PCLADAITKT NILVVEGYLF ELPDTVRTIS KVCEKARKCG ALVAITASDV SCIERHYDDY  420
WEIMANYADI VFANSAEARA FCHFSSKESP LSATRYLSHF VPLVSVTDGP KGSYIGVKGE  480
AIYIPPFPCM PVDTCGAGDA YASGILYGIS RGVSDLKSIG SIAAQVASVV VGQQGTRLRV  540
QDAIRLAESF SVHCRNSTIW SDIGSDQISS FIFFLMILSI FGWEFWFFMD FMGEDSVAGG  600
SDVSGFDNWG PGITDQAVWA TEDDYRAWNT GLSSETPSNS SQDGRHSQNR STSEPPHKKS  660
RNSQGVDSVS NRSKAIGKMF FKTKLCCKFR AGVCPYITNC NFAHGIEELR KPPPNWQDIV  720
AAHESERGGG VMLEPREEHQ IPTASSPELR AESQRSYKGR HCKKFHTEEG CPYGDACTFL  780
HAEQSRARES VAISVTPTVG GFGNNAAGAN LKPSNWKTRI CNKWEMTGYC PFGNKCHFAH  840
GAAGLAPNSI WIFDLVLISA IFLAELHKYG GGLVEMEGTD SLSTPPDTKQ GGGPFKTAES  900
TVPSTISAPR ADVYHLGSGV HVQRPSGIVQ RTGQRVLHKW KGPEKVSKIY GDWIDDIE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
3ubo_A3e-13365537147317adenosine kinase
3ubo_B3e-13365537147317adenosine kinase
4kad_A4e-13364537166337Probable sugar kinase protein
4kad_B4e-13364537166337Probable sugar kinase protein
4kah_A4e-13364537166337Probable sugar kinase protein
4kah_B4e-13364537166337Probable sugar kinase protein
4kal_A4e-13364537166337Probable sugar kinase protein
4kal_B4e-13364537166337Probable sugar kinase protein
4kan_A4e-13364537166337Probable sugar kinase protein
4kan_B4e-13364537166337Probable sugar kinase protein
4kbe_A4e-13364537166337Probable sugar kinase protein
4kbe_B4e-13364537166337Probable sugar kinase protein
4lbg_A4e-13364537166337Probable sugar kinase protein
4lbg_B4e-13364537166337Probable sugar kinase protein
4lbx_A4e-13364537166337Probable sugar kinase protein
4lbx_B4e-13364537166337Probable sugar kinase protein
4lc4_A4e-13364537166337Probable sugar kinase protein
4lc4_B4e-13364537166337Probable sugar kinase protein
4lca_A4e-13364537166337Probable sugar kinase protein
4lca_B4e-13364537166337Probable sugar kinase protein
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754421e-162HG975442.1 Solanum pennellii chromosome ch03, complete genome.
GenBankHG9755151e-162HG975515.1 Solanum lycopersicum chromosome ch03, complete genome.
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA37292343
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G32360.11e-113C3H family protein