PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG041572t1
Common NameTCM_041572
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family bHLH
Protein Properties Length: 732aa    MW: 83550.2 Da    PI: 4.7565
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG041572t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
Sequence ? help Back to Top
Protein Sequence    Length: 732 aa     Download sequence    Send to blast
MSPNESTQQL NSQRQIDVEV EAGSVARNED SICSWRPFSP VMSPNEMNHS LNSIFENLIA  60
EQQLKGPENF EVETSSVAQT KDFSPIWISF VTDMLPPGEG KPNEVPQQIN SPDQQLNSLA  120
ENLTADLRSN SQRDVEVGLR CLQPNRSHLD HEERPSDQAS NGRKQRGRKP RVTPVDTAKI  180
QLDQAGASAE ACNGQKRIRT EEQENERKRK KNESDRKYRA DVRNELKELR KIKPVYDTLM  240
TIASSFGGID QLESLINGMN SDIHKLQEKE VEYDMFQQGI EIPDRVPSMR ANEVQVCGIE  300
EMKSLSDKFK EMQAECHRLQ LLKSKYGEIE DIESMLDKFK NMEAESQRLE HIKMLFGGID  360
EIELEIRRLQ NMELQLERHK QMVNQKELDS FQASPGSLQQ LELDHCYRLQ KEADLDRFEQ  420
LKSKFGGTEE LEFKLDKFHW MEAELHKLEQ IKSEFGGVDE MEAEIYRLKE IESQHDKQKE  480
LQFFPESPGS LQEELGAQSL DLNGSSDAVS ADGISLMSPA AVRGTNTMHD MQYSDVLVTK  540
FMAKLDDDSV VGNVDRSSFK DLDGEPKKVG QYCLPPSLVS TAEDIIKAYG DITKKCKFSP  600
RIIEDIYVLF CAAIKEMGDL SLEQVTEEVM LKWRDAITDA DRSACDVEFA MKHLEKIAYG  660
YFGLKAYNDR NSLKQRMTIL KAEEEVLRKE LEKKANEMKA VKAKEGDLTS KRCKVCQEFA  720
DQFLDKTISV F*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1206210RKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007016050.20.0PREDICTED: uncharacterized protein LOC18590454 isoform X1
TrEMBLA0A061GZV00.0A0A061GZV0_THECC; Uncharacterized protein isoform 1
STRINGEOY336690.0(Theobroma cacao)
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]