PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000517t1
Common NameTCM_000517
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 483aa    MW: 54954.7 Da    PI: 9.5376
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000517t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B344.72.5e-1458142999
                       HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                        +l+ g l +p kf++++g+   ++    l+ ++ ++W+v+   +k++g+++l++GW+eF +   L+ g f+vF+++g  +f  +v +f++
  Thecc1EG000517t1  58 GTLRDGKLGIPTKFVKRYGNGMSSP--ALLRVPNDEVWKVEP--TKCDGKVWLKNGWQEFSNHYSLEYGHFLVFRYEGFCNF--HVVIFDR 142
                       5678899***********7665555..89999**********..********************************987888..9999998 PP

2B364.12.2e-20269368197
                       EEEE-..-HHHHT..T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE....EEE-TTHHHHHHHHT--TT-EEEEEE-SS CS
                B3   1 ffkvltpsdvlks..grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr....yvltkGWkeFvkangLkegDfvvFkldgr 87 
                       f+ v+ ps+v+ +   rl +p++f+ +h +k+ +  +++l +++g++W+v l yr++ gr      l++GWk Fvk+n+++ gD++vF+l++ 
  Thecc1EG000517t1 269 FLLVMQPSYVGLNgkWRLAIPNNFVWKHLMKEDC--EVILCNSNGKTWTVSL-YRRGNGRellyAGLQTGWKTFVKDNNIQIGDVCVFELINC 358
                       677889999998855589**********877555..8***************.555555548777789*********************9976 PP

                       SEE..EEEEE CS
                B3  88 sefelvvkvf 97 
                       +e +++v+++
  Thecc1EG000517t1 359 MEISFKVTIY 368
                       7777777776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.102.4E-2246143IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.94E-2247143IPR015300DNA-binding pseudobarrel domain
CDDcd100173.82E-1849141No hitNo description
PROSITE profilePS5086311.94250143IPR003340B3 DNA binding domain
SMARTSM010198.5E-1751143IPR003340B3 DNA binding domain
PfamPF023622.1E-1157142IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.106.7E-27261369IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.39E-25262368IPR015300DNA-binding pseudobarrel domain
CDDcd100177.48E-24267368No hitNo description
PfamPF023628.1E-18269368IPR003340B3 DNA binding domain
SMARTSM010193.1E-18269371IPR003340B3 DNA binding domain
PROSITE profilePS5086313.86271371IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 483 aa     Download sequence    Send to blast
MGPNLRPNGT VFRGIKLEIA KENYNLQRSM QMVILQKSND HPMFISESPH FFTIILPGTL  60
RDGKLGIPTK FVKRYGNGMS SPALLRVPND EVWKVEPTKC DGKVWLKNGW QEFSNHYSLE  120
YGHFLVFRYE GFCNFHVVIF DRSASEIEYP YGSNNHRQHK ELPEQKIEES EDADSLQILE  180
DISPSRKTGE KSHLPCSRPH KMMRSANSAN KTESNLKCES LAPHFRHNGS PDRKADKSTT  240
SHRIKKLNAD KKAKALQRAR AFKSENPFFL LVMQPSYVGL NGKWRLAIPN NFVWKHLMKE  300
DCEVILCNSN GKTWTVSLYR RGNGRELLYA GLQTGWKTFV KDNNIQIGDV CVFELINCME  360
ISFKVTIYQG QTDVFRPVKR NVGASSTIQG DMSSLHNQHP TEEFPVPKIE ENKRNASGEI  420
LDDILLYTGR KINISNPRNL PCTYPPICAS LVSSTTAKGM PGIWQSGSDL DQSIRRHIGF  480
YL*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A3e-1223936919141B3 domain-containing transcription factor VRN1
4i1k_B3e-1223936919141B3 domain-containing transcription factor VRN1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017972198.10.0PREDICTED: B3 domain-containing transcription factor VRN1
TrEMBLA0A061DHL40.0A0A061DHL4_THECC; AP2/B3-like transcriptional factor family protein, putative
STRINGEOX912710.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14219723
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.12e-39B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]