PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG007171t2
Common NameTCM_007171
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HB-PHD
Protein Properties Length: 951aa    MW: 105016 Da    PI: 4.6836
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG007171t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox37.44.3e-128639031252
                       HHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHH CS
          Homeobox  12 leeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrR 52 
                        ++L + F++n+yp++++++ LAk+l++t +qV+ WF N R
  Thecc1EG007171t2 863 KQRLYKSFKENQYPDRATKQSLAKELDMTFQQVSKWFDNAR 903
                       689************************************99 PP

2PHD39.41.4e-13492547151
                       SBTTTSS..TCTTSSEEEBSS.SSSEEETTTSTSSSSHHSHHSS..TBSSHHHHTT CS
               PHD   1 rCkvCgk..sdeegelvlCdg.CkewfHlkClglkleseekpeg..ewlCeeCkek 51 
                       +C+ Cg+    ++++++lCdg C++ fH++Cl+++l +e++p +   wlC+ C++k
  Thecc1EG007171t2 492 FCAKCGSkdLSANNDIILCDGaCDRGFHQYCLQPPLLKEDIPPDdeGWLCPGCDCK 547
                       699***977677********66*******************99999*******985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF579031.86E-14482550IPR011011Zinc finger, FYVE/PHD-type
Gene3DG3DSA:3.30.40.101.1E-13488547IPR013083Zinc finger, RING/FYVE/PHD-type
PROSITE profilePS5001611.145490547IPR019787Zinc finger, PHD-finger
PfamPF006283.9E-11492547IPR019787Zinc finger, PHD-finger
SMARTSM002492.4E-10492545IPR001965Zinc finger, PHD-type
CDDcd155042.66E-27492544No hitNo description
PROSITE patternPS013590493544IPR019786Zinc finger, PHD-type, conserved site
Gene3DG3DSA:1.10.10.607.1E-13841903IPR009057Homeodomain-like
SuperFamilySSF466891.75E-11850905IPR009057Homeodomain-like
SMARTSM003896.6E-10851913IPR001356Homeobox domain
PfamPF000461.4E-9862903IPR001356Homeobox domain
CDDcd000864.66E-11863903No hitNo description
PROSITE profilePS5007112.115866909IPR001356Homeobox domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005515Molecular Functionprotein binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 951 aa     Download sequence    Send to blast
MIKVEHMGVS SSQAKSKKGN HFCPEESTSE QAHEFGSEYL LTELSENKNQ CGYAATQNES  60
AENATGVSSS GVHERSPEYV AKNSSPERSG LLPKGVMGHN HTDKSFYAQE TVSGKTHEYD  120
CEYVRTETSE EKHQPGSEIV QNELEEACSL VCDLPAKNLQ TFSEGLSENA ITESLGLLPE  180
DSSKHTKTDK LSCPQLVSSE PTVNFGSGNV CKELGESPEQ RQQLDSESLP NGIEESTIAV  240
SSNVSNQALQ LKPEDMGKSH CGGHLHSPPE GVTNVIQSSK SPLVEPLGLP QEFAQGNPST  300
QQSGLPCEDM AQNSGVEQHE TKPKNLLENS GRRRNGKTSK TIKKKYMLRS LRSSDRVLRS  360
KLQEKPKATE SSNNLADVGS SEQQKRRKRR RRKANREVAD EFSRIRTHLR YLLNRINYER  420
SLIAAYSTEG WKGLSLEKLK PEKELQRATS EILRRKLKIR DLFQHIDSLC AEGKLPESLF  480
DSEGQIDSED IFCAKCGSKD LSANNDIILC DGACDRGFHQ YCLQPPLLKE DIPPDDEGWL  540
CPGCDCKVDC IELVNESQGT SFSITDSWEK VFPEAAVAAA GQNQDPNFGL PSDDSDDNDY  600
NPDGSETDEK DHGDESSSEE SEFTSTSEEL EVPAKVDQYL GLPSDDSEDD DYDPDGPNHD  660
EVVKPESSSS DFSSDSEDLD AMLEEDITSQ KDEGPMANSA PRDSKRRKPK LGEKESMNDE  720
LLSIMEPASE QDGSAISKKR SIERLDYKRL YDETYGNVPS SSSDDEDWSD ITAPRKRNKC  780
TAEVASAPEN GNVSVSRTVS VSDGLKQNPE ETEHKPRRKT RQMSRFKDTD SSPAEIQGNT  840
SVSGSSGKKA GSSTYKRLGE AVKQRLYKSF KENQYPDRAT KQSLAKELDM TFQQVSKWFD  900
NARWSFNNSP SSHETIANNA SEKDITSSLP NKEVTGSGNV RDGDNSGKIN *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1383389QKRRKRR
2384392KRRKRRRRK
3386390RKRRR
4386392RKRRRRK
5387392KRRRRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAM4641618e-40AM464161.2 Vitis vinifera contig VV78X020750.14, whole genome shotgun sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017971445.10.0PREDICTED: homeobox protein HAT3.1
RefseqXP_017971446.10.0PREDICTED: homeobox protein HAT3.1
TrEMBLA0A061E0320.0A0A061E032_THECC; Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1
STRINGEOX983990.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G19510.11e-94Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]