PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG036843t1
Common NameTCM_036843
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 260aa    MW: 29710.3 Da    PI: 10.2648
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG036843t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B351.22.3e-1612901299
                      HTT-EE--HHH.HTT.---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                B3 12 ksgrlvlpkkfaeeh.ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99
                       +++l++p +f+ ++ +g+ ++    +l++  +r+W+vkl  ++++    +++GW+eF+++++L +gD++vF++ g++ f   v+vf++
  Thecc1EG036843t1 12 FHKQLSIPLSFFIKYlKGQ-NC-ERAVLRSCGSRTWSVKL--KGRR----FEDGWEEFARDHDLYVGDVLVFRHGGNMVF--DVMVFDT 90
                      4668***********7444.34.479**************..7777....9**********************9999999..8888875 PP

2B364.71.4e-20137226495
                       E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
                B3   4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                       +++++++lk + l +p+kfa+++g+  +   +++l d++grsW  +l+++k++g++++++GW++ + an+Lke D+v+++l+g+ ++++ +k
  Thecc1EG036843t1 137 ATLKPNNLKVSKLNIPRKFARSNGLTDRF-CEMVLVDQQGRSWIANLRHKKSDGQVYIGRGWRNLCIANNLKEEDSVLLELIGN-GKKPIFK 226
                       444777888888***********999765.5***************************************************95.4445555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5086312.464191IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.5E-20289IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.22E-19291IPR015300DNA-binding pseudobarrel domain
CDDcd100174.07E-21289No hitNo description
SMARTSM010195.5E-20491IPR003340B3 DNA binding domain
PfamPF023621.2E-121288IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.3E-24127225IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.96E-22130227IPR015300DNA-binding pseudobarrel domain
CDDcd100175.81E-21133227No hitNo description
PROSITE profilePS5086313.578134233IPR003340B3 DNA binding domain
SMARTSM010192.5E-22135231IPR003340B3 DNA binding domain
PfamPF023621.6E-19136226IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009793Biological Processembryo development ending in seed dormancy
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 260 aa     Download sequence    Send to blast
MPHFLKPLLP GFHKQLSIPL SFFIKYLKGQ NCERAVLRSC GSRTWSVKLK GRRFEDGWEE  60
FARDHDLYVG DVLVFRHGGN MVFDVMVFDT RSACQREYPL FAMKGKDQKK SSAKRFGKQL  120
EKCTSTSFKH EHPYFVATLK PNNLKVSKLN IPRKFARSNG LTDRFCEMVL VDQQGRSWIA  180
NLRHKKSDGQ VYIGRGWRNL CIANNLKEED SVLLELIGNG KKPIFKLEVA RDSSAKTKPN  240
HPDSKAGDWS CFCEGSSRC*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007011630.20.0PREDICTED: putative B3 domain-containing protein REM15 isoform X1
TrEMBLA0A061GQ630.0A0A061GQ63_THECC; DNA binding protein, putative isoform 1
STRINGEOY292490.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM20422261
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00260.17e-49B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]