PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG007802t1
Common NameTCM_007802
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family bHLH
Protein Properties Length: 345aa    MW: 39339.9 Da    PI: 8.0137
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG007802t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH26.41.2e-08213258555
                       HHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
               HLH   5 hnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                       h+e  r+RR+++ ++   L++l+P++     kK++  ++L++A +YI+ L+
  Thecc1EG007802t1 213 HSELARKRRQKLSDKTRCLQKLMPWD-----KKMDTGTMLQEAYKYIRFLE 258
                       89999********************9.....9****************995 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5088813.611208257IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474594.84E-12208268IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000838.75E-9212262No hitNo description
PfamPF000103.3E-6213258IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene3DG3DSA:4.10.280.101.3E-11213265IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003533.6E-8214263IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 345 aa     Download sequence    Send to blast
MCVSLKILLY SRFHFCYTEH RCYVAFCTFP LSSLNFHCYH FSVFDINSMK MERGFGSTHP  60
CYPPAFTAEF SINSSYEANF NSIFNDPNLQ PLLPLPSDPS TFNFFSQDFP SLPLEPQLPV  120
PDLDSLYSSS LPTKIPDILP DSTQFLDFFN KPLPDLHSLE QPHRQPHFTE PSVSSSTRKL  180
KRTRLDLNLS DNNPQTLDSI VQSFNSPSPV IPHSELARKR RQKLSDKTRC LQKLMPWDKK  240
MDTGTMLQEA YKYIRFLEAQ VSILQSMPIS SSFASTEHNA PVGFDYGGLG RLNRQQLLQV  300
LVNSPVAQTM LYSQGFCVFA YEQLVSLKKA KERKAVLQQF LFGN*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021298831.10.0transcription factor bHLH117
TrEMBLA0A061E4F60.0A0A061E4F6_THECC; DNA binding-like protein
STRINGEOX992010.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM133992228
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22100.12e-21bHLH family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]