PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG042467t1
Common NameTCM_042467
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Trihelix
Protein Properties Length: 242aa    MW: 27521.2 Da    PI: 10.2074
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG042467t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix43.39.8e-1423106186
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm..rergf..erspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                       +W++  + +Liea  ++  rl+rg+l++++W+ev++++  r+ g   +++ +qCk++++ l+k+yk +++          s++p+f++++
  Thecc1EG042467t1  23 CWSEGATGTLIEAWGDRYLRLNRGNLRQKDWQEVADAVnsRQNGAkpRKTDVQCKNRIDTLKKKYKLERAKPPP------SKWPFFKRID 106
                       7*************************************885555544779*****************9998886......58*****997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.4E-2621108No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 242 aa     Download sequence    Send to blast
MNLRGLPHIT HSSGGSGGGR EDCWSEGATG TLIEAWGDRY LRLNRGNLRQ KDWQEVADAV  60
NSRQNGAKPR KTDVQCKNRI DTLKKKYKLE RAKPPPSKWP FFKRIDSLIG ANASVNKKHS  120
AVTFTIKPKA TSFLKGPRST ESSFGDEDGD EDEVRKEHRV EDVGLSDGAA CRELARAILK  180
FGEIYERIES SKQQQMMELE KQRMEFTKDL EFQRMNMLVD AQLEMEKSKR QKHLTRSGKK  240
H*
Functional Description ? help Back to Top
Source Description
UniProtTranscription regulator that may repress the maturation program during early embryogenesis. {ECO:0000269|PubMed:21330492}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00488DAPTransfer from AT5G05550Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007008911.21e-180PREDICTED: trihelix transcription factor ASIL2
SwissprotQ9LJG84e-36ASIL2_ARATH; Trihelix transcription factor ASIL2
TrEMBLA0A061FKY61e-179A0A061FKY6_THECC; Sequence-specific DNA binding transcription factors
STRINGEOY177211e-180(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49262141
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G05550.11e-72sequence-specific DNA binding transcription factors
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]