PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG033936t2
Common NameTCM_033936
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 382aa    MW: 43448.6 Da    PI: 8.4813
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG033936t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B353.54.3e-17331121297
                       HTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
                B3  12 ksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97 
                        ++rl +p kfa+++++k   + t+tl+ +sg  W+v l  +  +++ ++  GWk Fvk++ L e+D ++Fk++g s+f   v +f
  Thecc1EG033936t2  33 FLQRLEVPEKFAKNMKQK--LPETVTLKGPSGIIWDVGL--KADGDTLFFDCGWKIFVKDHSLVENDLLIFKYNGMSQF--DVLMF 112
                       567899******666544..888****************..999999**************************998888..66665 PP

2B349.48e-16269350185
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE- CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkld 85 
                       f  v++p +v ++  + +p   +++h +k++e  +++    + ++W+ ++ y +++g   l++GW++Fv++n+L+e D +vF+  
  Thecc1EG033936t2 269 FMVVMKPTHVARRFYMAIPTAWVAKHLSKQNE--DVI-LRINKQTWKTRFYYHRNRGCGGLSGGWRNFVNDNNLDEDDACVFEPA 350
                       77899*********************666444..455.456889******88666666679**********************43 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019362.75E-1921113IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086313.8622115IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.6E-2025113IPR015300DNA-binding pseudobarrel domain
SMARTSM010198.8E-1225115IPR003340B3 DNA binding domain
CDDcd100173.02E-1830113No hitNo description
PfamPF023624.3E-1434109IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.8E-19268365IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.21269367IPR003340B3 DNA binding domain
PfamPF023628.2E-14269351IPR003340B3 DNA binding domain
SMARTSM010194.8E-12269362IPR003340B3 DNA binding domain
CDDcd100172.22E-19269365No hitNo description
SuperFamilySSF1019365.69E-18269358IPR015300DNA-binding pseudobarrel domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005773Cellular Componentvacuole
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 382 aa     Download sequence    Send to blast
MGDSCKDCRS WEEEIFWTHF QSIHFSQFLH GDFLQRLEVP EKFAKNMKQK LPETVTLKGP  60
SGIIWDVGLK ADGDTLFFDC GWKIFVKDHS LVENDLLIFK YNGMSQFDVL MFDGRSLCEK  120
AASYFVRKCG HTEYDSGCQT KRKMNETPVE IVHNSSHCGL ESSPEKSINN NIDTRPSRQP  180
ITSAATNKKL RIVGSSTRSI PARKSLRGKD LTTFAAEVKV ETGDLEFDHT SMDGDVFSPR  240
HTARKRRATQ VEKANVFLMA QEALPREGFM VVMKPTHVAR RFYMAIPTAW VAKHLSKQNE  300
DVILRINKQT WKTRFYYHRN RGCGGLSGGW RNFVNDNNLD EDDACVFEPA DIGNKPMILD  360
VSIFRVLQAP VPLIQVHPAS Y*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00468DAPTransfer from AT4G33280Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007017398.20.0PREDICTED: B3 domain-containing protein REM16
TrEMBLA0A061FC700.0A0A061FC70_THECC; AP2/B3-like transcriptional factor family protein, putative isoform 1
STRINGEOY146210.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.17e-90B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]