PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006973t1
Common NameTCM_006973
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 475aa    MW: 54386.6 Da    PI: 9.8418
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006973t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B353.35e-172032851299
                       HTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                B3  12 ksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                       +s+ l +p +f+++h ++k+ ++  +l+ +sg  W++++  +k++++++++ GW++F + + L ++Df+v++++g+++f   v++f+k
  Thecc1EG006973t1 203 ASRSLEIPPHFVRTH-LPKRIPTRAVLRGPSGDYWKITM--CKQDRSTIMQHGWQQFYQNHCLGDKDFLVLRYNGNMCF--DVQIFEK 285
                       566799******999.57789999***************..**************************************..9999987 PP

2B351.42e-163834621499
                       T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                B3  14 grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                        +l++p +f+++h  +    ++ t+ +++ +sWev+l y  ++   v++kGW++F+ +n+L+  D++vF+l+    +e++v++f +
  Thecc1EG006973t1 383 FLLTIPTSFLNAHLPQ--ARTEFTFWTSKEKSWEVTLLY--TDTNKVFSKGWRRFAVDNKLEMDDSCVFELVA--PREMRVHIFPT 462
                       579*******999533..3459***************55..444468***********************986..67789999865 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019361.37E-20190288IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.102.2E-21191288IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.533192286IPR003340B3 DNA binding domain
SMARTSM010197.4E-13193286IPR003340B3 DNA binding domain
CDDcd100176.72E-16197283No hitNo description
PfamPF023628.1E-14204285IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.4E-24363461IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.69E-23364461IPR015300DNA-binding pseudobarrel domain
CDDcd100171.28E-17368461No hitNo description
PROSITE profilePS5086313.197370463IPR003340B3 DNA binding domain
SMARTSM010191.8E-13370463IPR003340B3 DNA binding domain
PfamPF023625.1E-14384461IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009553Biological Processembryo sac development
GO:0009567Biological Processdouble fertilization forming a zygote and endosperm
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 475 aa     Download sequence    Send to blast
MCFDVQIFEK SGCERNCVSI TRMHQDSTEK IKEEEMDVTD TVKTKSQVQI GGKAKKKSAE  60
EKDLASSSRK RPAGRKITAK LCKAAKNFTS NFPHFKHCIT RCNVEVPFQL TIPTSSSTQN  120
RIHLLDFERE ILGGLTLPSL LRMTFNLYDL ELLTSNCGTK KMQSHLESKK KMVKKSSKGV  180
ESNLPDHDNR PSFSMFILHK GDASRSLEIP PHFVRTHLPK RIPTRAVLRG PSGDYWKITM  240
CKQDRSTIMQ HGWQQFYQNH CLGDKDFLVL RYNGNMCFDV QIFEKSGCER NCVSITRTHQ  300
DSTEKIKEEE MDLTDMVKTK SQVQIGGKAK KKSAEEKDLA SSSRKRPAGR KITAKLCKAA  360
KDFTSNFPHF KHCITRCNVD IPFLLTIPTS FLNAHLPQAR TEFTFWTSKE KSWEVTLLYT  420
DTNKVFSKGW RRFAVDNKLE MDDSCVFELV APREMRVHIF PTERGNCSQL FVQV*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017970011.10.0PREDICTED: B3 domain-containing protein Os01g0723500
TrEMBLA0A061DZK90.0A0A061DZK9_THECC; Uncharacterized protein
STRINGEOX981450.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM56521135
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G18000.12e-26VERDANDI
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]