PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG029976t1
Common NameTCM_029976
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 906aa    MW: 99318.6 Da    PI: 7.5706
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG029976t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B3581.8e-18355454198
                       EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEETTS-EEEEEE....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSE CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltledesgrsWevkliy..rkksgryvltkGWkeFvkangLkegDfvvFkldgrse 89 
                       f+k l+ sd+++ grlvlpkk+ae++   ++++e+  l+++d++g++W +++++  +++s++yvl+ G     +  +L++gD+v+F++ +  +
  Thecc1EG029976t1 355 FEKMLSASDAGRIGRLVLPKKCAEAYfpPISQPEGLPLKVQDSKGKEWIFQFRFwpNNNSRMYVLE-GVTPCIQNMQLQAGDVVTFSRLE-PG 445
                       89**************************555566778***************9989999999***9.********************655.56 PP

                       E..EEEEE- CS
                B3  90 felvvkvfr 98 
                       ++lv++ ++
  Thecc1EG029976t1 446 GKLVMGCRK 454
                       656665554 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019362.09E-19352451IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.105.6E-30352460IPR015300DNA-binding pseudobarrel domain
CDDcd100171.08E-21353454No hitNo description
PROSITE profilePS5086310.87355456IPR003340B3 DNA binding domain
SMARTSM010196.7E-20355455IPR003340B3 DNA binding domain
PfamPF023621.1E-15355454IPR003340B3 DNA binding domain
PROSITE profilePS5105014.012586636IPR011124Zinc finger, CW-type
PfamPF074969.4E-12591633IPR011124Zinc finger, CW-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048366Biological Processleaf development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008270Molecular Functionzinc ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 906 aa     Download sequence    Send to blast
MTSTSGAASS SKICFNSDCK DLKSERARKG WRLRTGELAE LCDRCASAFE EGRFCDTFHL  60
NASGWRSCES CGKRVHCGCI VSVYAFTLLD AGGIECIACA RKNVVLGSNS SWPPSLLFHP  120
PLSERLKDYS AKGWSQLAGS GPVPWRQAPS LFNSPISQPE WHSKVCYEVD LSTGIDRLNA  180
DRLSTPSLEK KKIEDFSERL MNGTLKLGTR DIHENGNAGI NCEEQPGSCL TKSQQPSLKE  240
EPSNPPLGLS VPYTSPDEAN GQIGVSGPHL RPNPPPPLAK QFHSNLHNGL DSSGETQIRN  300
GRPRPDGRGR NQLFPRYWPR FTDQDLQQIS GEYPLILWGI LDNWECSNSV ITPLFEKMLS  360
ASDAGRIGRL VLPKKCAEAY FPPISQPEGL PLKVQDSKGK EWIFQFRFWP NNNSRMYVLE  420
GVTPCIQNMQ LQAGDVVTFS RLEPGGKLVM GCRKASTASA SEQDNEAKNN SSASPSFSSI  480
NQAELADPTS WSKVDKSGYI AKEALGTKLA VSRKRKNSTL GSKSKRLRID NEDLIELKLT  540
WEEAQGLLRP PPNHVPSVVV IEGFEFEEYE DAPILGKPTI FATDNSGEKI QWAQCEDCFK  600
WRRLPSSALL PSKWTCASNS WDPERSFCSV AQELTAEQLD DLLPHCNPAA SKKMKAAKQE  660
PENVDALEGL DTLANLAILG EGEALPASSQ ATTKHPRHRP GCSCIVCIQP PSGKGPKHKQ  720
TCTCNVCQTV KRRFRTLMLR REKKQSQKEA ETTRKKQQPS LPDKLLDDDP LPSTNAGNSS  780
PNPKKLVSEG SDDDPNRIKS STSPFKGQID LNIQPEREEE LSPGSDSGSM MRLLQDATER  840
YLRQQRMLSS GVNSDSTVTQ AQSGGGTEGE KTSSSVNLGA SHQDADRDHS AVFSIKSSAP  900
TSATG*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6j9a_A8e-623194582127B3 domain-containing transcription repressor VAL1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007025776.20.0PREDICTED: B3 domain-containing protein Os07g0563300
SwissprotQ0D5G40.0Y7633_ORYSJ; B3 domain-containing protein Os07g0563300
TrEMBLA0A061GGV70.0A0A061GGV7_THECC; Transcription factor, putative isoform 1
STRINGEOY283970.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM68572536
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32010.11e-157HSI2-like 1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]