PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG033936t3
Common NameTCM_033936
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 349aa    MW: 39405.1 Da    PI: 9.3499
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG033936t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B3505.6e-166791897
                      --HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
                B3 18 lpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97
                      +p kfa+++++k   + t+tl+ +sg  W+v l  +  +++ ++  GWk Fvk++ L e+D ++Fk++g s+f   v +f
  Thecc1EG033936t3  6 VPEKFAKNMKQK--LPETVTLKGPSGIIWDVGL--KADGDTLFFDCGWKIFVKDHSLVENDLLIFKYNGMSQF--DVLMF 79
                      799***666544..888****************..999999**************************998888..66665 PP

2B349.76.8e-16236317185
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE- CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkld 85 
                       f  v++p +v ++  + +p   +++h +k++e  +++    + ++W+ ++ y +++g   l++GW++Fv++n+L+e D +vF+  
  Thecc1EG033936t3 236 FMVVMKPTHVARRFYMAIPTAWVAKHLSKQNE--DVI-LRINKQTWKTRFYYHRNRGCGGLSGGWRNFVNDNNLDEDDACVFEPA 317
                       77899*********************666444..455.456889******88666666679**********************43 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5086314.621182IPR003340B3 DNA binding domain
SMARTSM010191.0E-4182IPR003340B3 DNA binding domain
SuperFamilySSF1019368.44E-17580IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.102.7E-18580IPR015300DNA-binding pseudobarrel domain
CDDcd100177.20E-17680No hitNo description
PfamPF023624.1E-13676IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.4E-19235332IPR015300DNA-binding pseudobarrel domain
CDDcd100175.09E-20236332No hitNo description
SuperFamilySSF1019364.71E-18236325IPR015300DNA-binding pseudobarrel domain
PfamPF023626.9E-14236318IPR003340B3 DNA binding domain
SMARTSM010194.8E-12236329IPR003340B3 DNA binding domain
PROSITE profilePS5086312.21236334IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005773Cellular Componentvacuole
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 349 aa     Download sequence    Send to blast
MKLQEVPEKF AKNMKQKLPE TVTLKGPSGI IWDVGLKADG DTLFFDCGWK IFVKDHSLVE  60
NDLLIFKYNG MSQFDVLMFD GRSLCEKAAS YFVRKCGHTE YDSGCQTKRK MNETPVEIVH  120
NSSHCGLESS PEKSINNNID TRPSRQPITS AATNKKLRIV GSSTRSIPAR KSLRGKDLTT  180
FAAEVKVETG DLEFDHTSMD GDVFSPRHTA RKRRATQVEK ANVFLMAQEA LPREGFMVVM  240
KPTHVARRFY MAIPTAWVAK HLSKQNEDVI LRINKQTWKT RFYYHRNRGC GGLSGGWRNF  300
VNDNNLDEDD ACVFEPADIG NKPMILDVSI FRVLQAPVPL IQVHPASY*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00468DAPTransfer from AT4G33280Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007017398.20.0PREDICTED: B3 domain-containing protein REM16
TrEMBLA0A061FCJ20.0A0A061FCJ2_THECC; AP2/B3-like transcriptional factor family protein, putative isoform 3
STRINGEOY146210.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.12e-71B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]