PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG029045t1
Common NameTCM_029045
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C2H2
Protein Properties Length: 1272aa    MW: 140740 Da    PI: 7.7763
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG029045t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H215.84.1e-0588110123
                       EEETTTTEEESSHHHHHHHHHHT CS
           zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                       ++C++C+k F r  nL+ H r H
  Thecc1EG029045t1  88 FVCEICNKGFQRDQNLQLHRRGH 110
                       89******************988 PP

2zf-C2H211.30.0011164186123
                       EEETTTTEEESSHHHHHHHHHHT CS
           zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                       +kC +C+k++  +s+ k H +t+
  Thecc1EG029045t1 164 WKCDKCSKRYAVQSDWKAHSKTC 186
                       58*****************9998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.30.160.604.8E-587110IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SuperFamilySSF576671.63E-687110No hitNo description
PROSITE profilePS5015710.94988110IPR007087Zinc finger, C2H2
SMARTSM003550.01788110IPR015880Zinc finger, C2H2-like
PROSITE patternPS00028090110IPR007087Zinc finger, C2H2
SMARTSM00355210129159IPR015880Zinc finger, C2H2-like
Gene3DG3DSA:3.30.160.607.4E-4152185IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SuperFamilySSF576671.63E-6159184No hitNo description
SMARTSM00355140164184IPR015880Zinc finger, C2H2-like
Gene3DG3DSA:3.20.20.801.1E-114631968IPR013781Glycoside hydrolase, catalytic domain
SuperFamilySSF514458.75E-97634966IPR017853Glycoside hydrolase superfamily
PfamPF013012.3E-114640964IPR031330Glycoside hydrolase 35, catalytic domain
PRINTSPR007421.3E-43643660IPR001944Glycoside hydrolase, family 35
PRINTSPR007421.3E-43664682IPR001944Glycoside hydrolase, family 35
PRINTSPR007421.3E-43719738IPR001944Glycoside hydrolase, family 35
PRINTSPR007421.3E-43775790IPR001944Glycoside hydrolase, family 35
PROSITE patternPS011820777789IPR019801Glycoside hydrolase, family 35, conserved site
PRINTSPR007421.3E-43868883IPR001944Glycoside hydrolase, family 35
PRINTSPR007421.3E-43933949IPR001944Glycoside hydrolase, family 35
SuperFamilySSF497857.23E-3310941135IPR008979Galactose-binding domain-like
Gene3DG3DSA:2.60.120.2606.0E-3410971253IPR008979Galactose-binding domain-like
SuperFamilySSF497857.23E-3311681263IPR008979Galactose-binding domain-like
PfamPF133648.9E-611691239IPR025300Beta-galactosidase jelly roll domain
PRINTSPR007421.3E-4312001216IPR001944Glycoside hydrolase, family 35
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005975Biological Processcarbohydrate metabolic process
GO:0016021Cellular Componentintegral component of membrane
GO:0003676Molecular Functionnucleic acid binding
GO:0004565Molecular Functionbeta-galactosidase activity
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1272 aa     Download sequence    Send to blast
MMKGLLFHDQ QQQVLEENMS NLTSASGEAS VSSGNRAEAA TNYPQQYFST PPPETQPAKK  60
KRNLPGNPDP DAEVIALSPK TLMATNRFVC EICNKGFQRD QNLQLHRRGH NLPWKLKQRT  120
SKEVRKKVYV CPEPSCVHHD PSRALGDLTG IKKHFCRKHG EKKWKCDKCS KRYAVQSDWK  180
AHSKTCGTRE YRCDCGTLFS RRDSFITHRA FCDALAEESA RAITGANPLL SSHQPGASAS  240
HINLQVPQFN AQDIQAFSLK KEQQSFSLRP EIPPWLSSQP MLGAGPGPPP QPIDLSSSSS  300
SIFSARLDHH HQEFTQTTHH QDLTHHVNPN PNPTSLGPTL PAYHPTTVPS PHMSATALLQ  360
KAAQMGATMS SKTGSSSAPA TAAAASLIRP HQQAHVSADS AGSNNNTTTA VFGLNLSSRE  420
ELAVIMLAQT KDGRFINETF SSTTTTPTTT TNAAAAARND HETGGIQGEG LTRDFLGLRA  480
FSHSDILNLA GLSNCMNTSH EQRNQSQKPW QVVDSPKFEA VHIALLAAPS AGTGSGYFYL  540
PGTTDSLKAE EMNHSKKLAA TESTSKSTAA MARKRSSKTT LIFFVLLSIV AFVAFVPVFA  600
SLPSLSSHSH DLHLHLRLHQ RQHRLEKSDA RKFEIAEDMF WKDGKPFQII GGDLHYFRIL  660
PEYWEDRLLR AKALGLNTIQ TYIPWNLHEP EPGKLVFEGI ADLVSFLKLC QKLGLLVMLR  720
AGPYICAEWD LGGFPAWLLA IEPDIRLRSS DPAYLQLVEG WWGVLLPKVA PLLYGNGGPI  780
IMVQIENEFG SYGDDKAYLR HLVKLARGHL GEDIILYTTD GGSRETLEKG TLVGDDVFSA  840
VDFTTGDDPW PIFELQKEFN SPGKSPPLSS EFYTGWLTHW GEKIARTDAD FTAAALEKIL  900
SRNGSVVLYM AHGGTNFGFY NGANTGADES DYKPDLTSYD YDAPITESGD VDNAKFKAIR  960
RVVGKYSSVS LPSFPSSNKK TGYGFIQLQK TRSLFDLLDG FDSAHIVEAE NPTAMEYFYQ  1020
MFGFLLYVSE YASKAGGNKL FIPKVHDRAQ VFISCPSRAD GGRVSYVGTI ERWSNQAIYL  1080
PNAKCVSNTS LFILVENMGR VNYGPYLFDR KGILSSVYVD GRVLNRWKMI PIPFQNLNEV  1140
PKFNPVIQVA SEFPKVSIRK KLEHKSEDVL EGPSFYTGHF SIDKTSEVTD TFISFRAWGK  1200
GIAFVNEFNI GRYWPTSGPQ CNLYIPAPIL RHGENVLVIF ELESPNPELV VDSVDQQDFN  1260
CGSSKASVRQ L*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4mad_A1e-147637124221578Beta-galactosidase
4mad_B1e-147637124221578Beta-galactosidase
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1114126KLKQRTSKEVRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAM4651831e-138AM465183.2 Vitis vinifera contig VV78X062580.11, whole genome shotgun sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017978893.10.0PREDICTED: beta-galactosidase 17
SwissprotQ93Z240.0BGA17_ARATH; Beta-galactosidase 17
TrEMBLA0A061GB640.0A0A061GB64_THECC; Beta-galactosidase
STRINGEOY271200.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM807722
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G55110.11e-116indeterminate(ID)-domain 7
Publications ? help Back to Top
  1. Iglesias N, et al.
    Apoplastic glycosidases active against xyloglucan oligosaccharides of Arabidopsis thaliana.
    Plant Cell Physiol., 2006. 47(1): p. 55-63
    [PMID:16267099]
  2. Ahn YO, et al.
    Functional genomic analysis of Arabidopsis thaliana glycoside hydrolase family 35.
    Phytochemistry, 2007. 68(11): p. 1510-20
    [PMID:17466346]
  3. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]
  4. Chandrasekar B,van der Hoorn RA
    Beta galactosidases in Arabidopsis and tomato - a mini review.
    Biochem. Soc. Trans., 2016. 44(1): p. 150-8
    [PMID:26862200]