PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006242t1
Common NameTCM_006242
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 265aa    MW: 29849 Da    PI: 9.8533
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006242t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B335.22.2e-11691387
                      EE-..-HHHHT.T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE..EEE-TTHHHHHHHHT--TT-EEEEEE-SS CS
                B3  3 kvltpsdvlks.grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr..yvltkGWkeFvkangLkegDfvvFkldgr 87
                      kvlt+ dv k+ +  ++ kk++ ++g+k   +++ ++ed++g +W + ++ rk+++    vl+kGW +Fv+  +L  gD vv++ ++ 
  Thecc1EG006242t1  6 KVLTKTDVQKRlSVRTVNKKCFLDFGNK--HKVEFKVEDKNGDVWPFVCSTRKGQDYpkPVLSKGWLRFVRRWKLAIGDRVVLHEIQG 91
                      7888888888855557778877777766..5689***************77666666999**********************995543 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019361.02E-12489IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.102.2E-13490IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.00254106IPR003340B3 DNA binding domain
PfamPF023621.6E-8691IPR003340B3 DNA binding domain
CDDcd100171.36E-136104No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 265 aa     Download sequence    Send to blast
MAVLSKVLTK TDVQKRLSVR TVNKKCFLDF GNKHKVEFKV EDKNGDVWPF VCSTRKGQDY  60
PKPVLSKGWL RFVRRWKLAI GDRVVLHEIQ GKAGTGLYRI EVIKRAKQSP GVLSPSILNH  120
DGDRSMGNIG KEPTGTTHST DQAMAYNQTD GPRDQTVSTV TSHSTDQAMA YKQTEGLRDQ  180
PVSAVTFHST DQAMAYNQTE RLNDLPVTDR VGSTMVEFIC LKPRVQVKEP KFIDFFELES  240
QDRKGKDMVE SPSTSLTTRF FEFL*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A061E4G00.0A0A061E4G0_THECC; Uncharacterized protein
STRINGEOX971430.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1783547
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G01500.17e-07B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]