PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMO66950
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family C2H2
Protein Properties Length: 825aa    MW: 90882.2 Da    PI: 9.3511
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMO66950genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H215.45.4e-057395123
              EEETTTTEEESSHHHHHHHHHHT CS
   zf-C2H2  1 ykCpdCgksFsrksnLkrHirtH 23
              + C++C+k F r  nL+ H r H
  OMO66950 73 FICEICNKGFQRDQNLQLHRRGH 95
              89******************988 PP

2zf-C2H211.70.00079150172123
               EEETTTTEEESSHHHHHHHHHHT CS
   zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
               +kC++C+k++  +s+ k H + +
  OMO66950 150 WKCEKCSKRYAVQSDWKAHSKIC 172
               58*****************9876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 825 aa     Download sequence    
MMKGLVMDES MSNLTSTSGE ISASSSATRI ETTFASTNQA PPLKKKRSLP GNPDPDAEVI  60
ALSPKSLLAT NRFICEICNK GFQRDQNLQL HRRGHNLPWK LKQRPKMEVI RKKVYVCPET  120
TCVHHDPSRA LGDLTGIKKH FSRKHGEKKW KCEKCSKRYA VQSDWKAHSK ICGTKEYRCD  180
CGTLFSRRDS FITHRAFCDA LAEESARAIP TLPNSLLSSS QQQVEIAGSS VSHNLSTLQQ  240
QQPQVFHSQA LQALAVKREQ DITYLLSGRS VAAADHSLPP WLACSSSFLD PLSQNIHENP  300
SQTQNPSSTT TTLAPFQTPT AASSPISPHM SATALLQKAA QMGVTMSSNK PLQSPAPVAM  360
QRPHHMSGTA GFIGSTSNPA GSAVGLSARD HGLAFFGNKA AAMEQVVAAT NSTAGAPSLL  420
HDMMSSLSST SGFDGSSSSF EQSFNGIFNP KLLGNNSNNF QEIHQQNFQK TAESQLSRSE  480
NNHERRGSSV SSNIIGGNSN NDGLTRDFLG LKAFPHRDFP NLAGFNHHGH GINSSPAYAQ  540
HNQHSHQSQT PWQVGQQRIV IRNQHGENLV GILHETGSKD VVIICHGFQS IKERIPMVSI  600
ANVLERQGIS AFRFDFAGNG ESEGSFMYGN YRREAEDLRA VVQHFCNKDR PVTAIIGHSK  660
GGNVVLLYAS KYNDVPTVVS ISGRFNLEKG MEGRLGKDFL QRIKQNGFID VFNRKGKFEY  720
RVTQESLMDR LSTDTRAACL LIDQNCRVLI IHGSMDRIVP AKDALEFARF IRNHKLHIVE  780
GADHEYTSHQ DELTKIVLDF VREGRQEKNT TQDSQSCSGG VRSRI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1100113KLKQRPKMEVIRKK
2102115KLKQRPKMEVIRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G55110.11e-109C2H2 family protein