PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000520t1
Common NameTCM_000520
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 279aa    MW: 30518.3 Da    PI: 11.2421
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000520t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B366.92.9e-2117102294
                       EEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EE CS
                B3   2 fkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvv 94 
                       fkvl     + +  l++p  f+++++g+   ++  tl++ sg sW+v++  +++ g+y++ +GW++Fv+++gL++gDfvvF+l+g+s+f +v+
  Thecc1EG000520t1  17 FKVL--IGDFVN-KLRIPPAFVKNFQGNV--PTNFTLKSNSGSSWRVTV--QNTEGSYFFCGGWSNFVEDQGLDSGDFVVFYLVGKSSFDCVI 102
                       5566..333333.489******9997774..4569999***********..*********************************999997765 PP

2B351.51.8e-161882691398
                       TT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
                B3  13 sgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                       +  +v+p +fa+e g+ +++s   +++d++gr W + +  + +s+++ l++GW +F  +n+L +gD+++F++++ ++   +v+++ 
  Thecc1EG000520t1 188 KFSVVVPSSFAREAGLAEKRS--TVIKDPKGRMWPLGI--SVGSRQVRLSAGWTKFRLENRLVAGDTLLFQYIRGTGNAIHVQIVG 269
                       45689*********9997775..8889***********..*********************************9888889999876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.101.2E-2410108IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.89E-2711110IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086313.78913106IPR003340B3 DNA binding domain
CDDcd100173.50E-191497No hitNo description
SMARTSM010196.5E-1316106IPR003340B3 DNA binding domain
PfamPF023621.5E-1816102IPR003340B3 DNA binding domain
SuperFamilySSF1019361.04E-17174269IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.7E-19174270IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086313.239176271IPR003340B3 DNA binding domain
CDDcd100171.39E-14177267No hitNo description
SMARTSM010191.5E-9179271IPR003340B3 DNA binding domain
PfamPF023627.4E-14187268IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 279 aa     Download sequence    Send to blast
MAALKMARKP KQNPSFFKVL IGDFVNKLRI PPAFVKNFQG NVPTNFTLKS NSGSSWRVTV  60
QNTEGSYFFC GGWSNFVEDQ GLDSGDFVVF YLVGKSSFDC VIYGPTGCGK KIVLKTKRKR  120
GRPKKSNEVT PSEAGASSFQ KATRVSPGCR ITRAPARRVI NVGQQIVVVS EAISKHPSFT  180
VVLKKYQKFS VVVPSSFARE AGLAEKRSTV IKDPKGRMWP LGISVGSRQV RLSAGWTKFR  240
LENRLVAGDT LLFQYIRGTG NAIHVQIVGK AGYGNSGR*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1109119KKIVLKTKRKR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007047117.20.0PREDICTED: B3 domain-containing protein Os01g0723500
TrEMBLA0A061DG190.0A0A061DG19_THECC; Uncharacterized protein
STRINGEOX912740.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14995620
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G66980.18e-21B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]