PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG033936t1
Common NameTCM_033936
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 410aa    MW: 46729.4 Da    PI: 9.091
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG033936t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B353.44.8e-17611401297
                       HTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
                B3  12 ksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97 
                        ++rl +p kfa+++++k   + t+tl+ +sg  W+v l  +  +++ ++  GWk Fvk++ L e+D ++Fk++g s+f   v +f
  Thecc1EG033936t1  61 FLQRLEVPEKFAKNMKQK--LPETVTLKGPSGIIWDVGL--KADGDTLFFDCGWKIFVKDHSLVENDLLIFKYNGMSQF--DVLMF 140
                       567899******666544..888****************..999999**************************998888..66665 PP

2B349.39.1e-16297378185
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE- CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkld 85 
                       f  v++p +v ++  + +p   +++h +k++e  +++    + ++W+ ++ y +++g   l++GW++Fv++n+L+e D +vF+  
  Thecc1EG033936t1 297 FMVVMKPTHVARRFYMAIPTAWVAKHLSKQNE--DVI-LRINKQTWKTRFYYHRNRGCGGLSGGWRNFVNDNNLDEDDACVFEPA 378
                       77899*********************666444..455.456889******88666666679**********************43 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019363.14E-1949141IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086313.8650143IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.9E-2053141IPR015300DNA-binding pseudobarrel domain
SMARTSM010198.8E-1253143IPR003340B3 DNA binding domain
CDDcd100175.76E-1858141No hitNo description
PfamPF023624.8E-1462137IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.103.2E-19296393IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.21297395IPR003340B3 DNA binding domain
PfamPF023629.3E-14297379IPR003340B3 DNA binding domain
SuperFamilySSF1019366.28E-18297386IPR015300DNA-binding pseudobarrel domain
CDDcd100174.35E-19297393No hitNo description
SMARTSM010194.8E-12297390IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005773Cellular Componentvacuole
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 410 aa     Download sequence    Send to blast
MPFTETRSKS KAFISYRTMS RSAVWVCKMG DSCKDCRSWE EEIFWTHFQS IHFSQFLHGD  60
FLQRLEVPEK FAKNMKQKLP ETVTLKGPSG IIWDVGLKAD GDTLFFDCGW KIFVKDHSLV  120
ENDLLIFKYN GMSQFDVLMF DGRSLCEKAA SYFVRKCGHT EYDSGCQTKR KMNETPVEIV  180
HNSSHCGLES SPEKSINNNI DTRPSRQPIT SAATNKKLRI VGSSTRSIPA RKSLRGKDLT  240
TFAAEVKVET GDLEFDHTSM DGDVFSPRHT ARKRRATQVE KANVFLMAQE ALPREGFMVV  300
MKPTHVARRF YMAIPTAWVA KHLSKQNEDV ILRINKQTWK TRFYYHRNRG CGGLSGGWRN  360
FVNDNNLDED DACVFEPADI GNKPMILDVS IFRVLQAPVP LIQVHPASY*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00468DAPTransfer from AT4G33280Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007017398.20.0PREDICTED: B3 domain-containing protein REM16
TrEMBLA0A061FC700.0A0A061FC70_THECC; AP2/B3-like transcriptional factor family protein, putative isoform 1
STRINGEOY146210.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM104282335
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.12e-89B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]