PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen01g042900.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family C2H2
Protein Properties Length: 423aa    MW: 47263.6 Da    PI: 8.7076
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen01g042900.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H216.72e-057092123
                      EEETTTTEEESSHHHHHHHHHHT CS
           zf-C2H2  1 ykCpdCgksFsrksnLkrHirtH 23
                      + C+ C+k F r  nL+ H r H
  Sopen01g042900.1 70 FICEVCNKGFQREQNLQLHRRGH 92
                      89*******************88 PP

2zf-C2H2130.0003146168123
                       EEETTTTEEESSHHHHHHHHHHT CS
           zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                       +kC++C+k +  +s+ k H +t+
  Sopen01g042900.1 146 WKCEKCSKKYAVQSDWKAHSKTC 168
                       58*****************9998 PP

3zf-C2H210.70.0017176194523
                       TTTEEESSHHHHHHHHHHT CS
           zf-C2H2   5 dCgksFsrksnLkrHirtH 23 
                       dCg+ Fsr++++++H   +
  Sopen01g042900.1 176 DCGTIFSRRDSFVTHRAFC 194
                       8**************8765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.30.160.604.4E-66992IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SuperFamilySSF576671.6E-56992No hitNo description
PfamPF121711.9E-57092IPR022755Zinc finger, double-stranded RNA binding
PROSITE profilePS5015710.997092IPR007087Zinc finger, C2H2
SMARTSM003550.00527092IPR015880Zinc finger, C2H2-like
PROSITE patternPS0002807292IPR007087Zinc finger, C2H2
SMARTSM00355120111141IPR015880Zinc finger, C2H2-like
Gene3DG3DSA:3.30.160.609.7E-5134167IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SuperFamilySSF576671.6E-5141166No hitNo description
SMARTSM00355140146166IPR015880Zinc finger, C2H2-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 423 aa     Download sequence    Send to blast
MSNSNLSSGN SSEEADETPY VLSSTSDGST AHQEHSQTHN KKRRKLPGNP DPSAEVIALS  60
PKTLMATNRF ICEVCNKGFQ REQNLQLHRR GHNLPWKLKQ KTSNEIKKRV YICPESSCIH  120
HNPSRALGDL TGIKKHFSRK HGEKKWKCEK CSKKYAVQSD WKAHSKTCGT KEYKCDCGTI  180
FSRRDSFVTH RAFCDALAEE NNKVNQVLAS TTQPLATGPE LISTTQMLNL PQIRNSNMKI  240
PSMPLNMAGS MFSSSSGFNQ LGTNSSNMSS ATALLQQAAQ MGATVSNNMN STLFNGVQVP  300
FQSNHNHDQN ETQIGSILQG FGGSMLQNNG DDHLKSSRVL QNEQGWFNNN NNSNTGLFNE  360
KQRILNKEAG HSNEESLTLD FLGIGGMRHR NLHEMHQQQQ EMSFEQQQVN HQSIQGVNSI  420
WDD
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_C6e-35142207368Zinc finger protein JACKDAW
5b3h_F6e-35142207368Zinc finger protein JACKDAW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14044KKRRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754400.0HG975440.1 Solanum pennellii chromosome ch01, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015065552.10.0protein indeterminate-domain 12-like
TrEMBLA0A3Q7EN510.0A0A3Q7EN51_SOLLC; Uncharacterized protein
STRINGSolyc01g099340.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA10324350
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G03840.17e-98C2H2 family protein