PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG033935t2
Common NameTCM_033935
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 525aa    MW: 58693.2 Da    PI: 8.9195
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG033935t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B347.23.9e-152771899
                      --HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                B3 18 lpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99
                      +p  f ++ +g + +   + l+ +sg sW v+l   +k++  ++ +GW +Fv+++ ++ gDf+vF++dg+  f   vkvf++
  Thecc1EG033935t2  2 IPVGFNRNLEGRTSG--SVLLRGPSGYSWVVEL--VRKDDDLLFVEGWADFVRDHSVECGDFLVFRYDGDLVF--DVKVFDQ 77
                      788888777877555..6***************..9999****************************998777..9999987 PP

2B352.96.6e-17207300298
                       EEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE...EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                B3   2 fkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy.rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                       + ++++ +v+ s  l +p +fa eh   ++++++++l++++g  W+v+    ++   +++l +GW +Fv++n++k+gD+++F++++  +fel+
  Thecc1EG033935t2 207 VRIMKRFNVSGSYTLNIPYQFAMEHL--PKCKTEIVLRNLKGACWTVNSVPtTRVHTSHTLCGGWLGFVRSNEIKVGDICIFEFVR--KFELR 295
                       56667778888899********9996..34889***************954566666799***********************876..999** PP

                       EEEE- CS
                B3  94 vkvfr 98 
                       v+++r
  Thecc1EG033935t2 296 VHILR 300
                       **998 PP

3B349.48.4e-16414503496
                       E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE.EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
                B3   4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr.yvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                       ++ + +v+ s  l +p kf +++   + ++++++l++ +g+ W+v+   + k+++ +++ +GW  Fv++n++k gD+++F+l+++   e++v+
  Thecc1EG033935t2 414 IMRKFNVSGSYTLKIPYKFSKAYL--PYCKTEVVLRNMQGKWWTVNSVPDSKGRAvHTFCGGWMAFVRDNDIKMGDICIFELVNK--CEMYVH 502
                       566667788889********9995..457889***************6655555579999*********************9873..335777 PP

                       E CS
                B3  96 v 96 
                       +
  Thecc1EG033935t2 503 I 503
                       6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5086315.58178IPR003340B3 DNA binding domain
PfamPF023628.9E-12277IPR003340B3 DNA binding domain
SuperFamilySSF1019361.67E-19280IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.0019278IPR003340B3 DNA binding domain
CDDcd100171.63E-14276No hitNo description
Gene3DG3DSA:2.40.330.104.1E-18280IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.103.9E-23197300IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019364.51E-25199301IPR015300DNA-binding pseudobarrel domain
CDDcd100171.03E-19204300No hitNo description
PfamPF023621.9E-14206300IPR003340B3 DNA binding domain
SMARTSM010193.5E-14206302IPR003340B3 DNA binding domain
PROSITE profilePS5086313.465206302IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.107.9E-23403505IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.67E-25405505IPR015300DNA-binding pseudobarrel domain
CDDcd100172.21E-20409503No hitNo description
PfamPF023623.5E-13411503IPR003340B3 DNA binding domain
PROSITE profilePS5086313.479411507IPR003340B3 DNA binding domain
SMARTSM010191.3E-10411507IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 525 aa     Download sequence    Send to blast
KIPVGFNRNL EGRTSGSVLL RGPSGYSWVV ELVRKDDDLL FVEGWADFVR DHSVECGDFL  60
VFRYDGDLVF DVKVFDQSSC EKEAAFHCKC SQGGSEYDRI VGQKRGREDG GVSLDQDCEG  120
LVKRTRESSS EFDRDVADKE HCGYEPILAA KTRRGLASCD ENNEGTILKT SGTEDLNLHG  180
GGCTPMVAEF EEKKVAQSFN SSFPFFVRIM KRFNVSGSYT LNIPYQFAME HLPKCKTEIV  240
LRNLKGACWT VNSVPTTRVH TSHTLCGGWL GFVRSNEIKV GDICIFEFVR KFELRVHILR  300
VGGEDPDRQS GKAVSNVLIN RSDATLPIKF VKKSSKVHSK SMKKVQMCDN KGFKMLDKKK  360
YGNAAKKSAS VALCSLSRSG NEKQAIGGLR MMLALDEEKA AQSFASGFPS FVRIMRKFNV  420
SGSYTLKIPY KFSKAYLPYC KTEVVLRNMQ GKWWTVNSVP DSKGRAVHTF CGGWMAFVRD  480
NDIKMGDICI FELVNKCEMY VHISGSGRKG LDHQHASTEL LTLR*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007017394.10.0PREDICTED: B3 domain-containing protein Os11g0197600
TrEMBLA0A061FB960.0A0A061FB96_THECC; AP2/B3-like transcriptional factor family protein, putative isoform 2 (Fragment)
STRINGEOY146190.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.15e-17B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]