PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000513t1
Common NameTCM_000513
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 421aa    MW: 48415.1 Da    PI: 9.7725
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000513t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B361.71.2e-1933120298
                       EEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EE CS
                B3   2 fkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvv 94 
                       fk++     ++ ++l++p+kf++++ ++   s t+ l+ +sg  W v++  ++  +++v+++GW++Fvk++ L + DfvvF++dg+s+f  +v
  Thecc1EG000513t1  33 FKIM--IGDFR-NQLRIPRKFMSNFREN--LSGTVYLRGPSGFMWAVEV--ERMFDEVVFGNGWQNFVKDHSLADADFVVFRYDGNSTF--NV 116
                       5555..44454.56********877755..788****************..**********************************9999..78 PP

                       EEE- CS
                B3  95 kvfr 98 
                        +f+
  Thecc1EG000513t1 117 VIFD 120
                       7776 PP

2B338.62e-12303385388
                       EE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSS CS
                B3   3 kvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrs 88 
                        v++p +v k   + +  k ++ h  +k + + l    ++ ++W v++    ++ r  +++GW +Fv +n+L+e D++vF+l+++ 
  Thecc1EG000513t1 303 IVMKPTHVCKAFTVNIREKWLDMHVPDKVR-IALLRVAPDEKRWPVRIMR--TKWRRGFARGWGKFVLDNNLEEHDVCVFELNEEG 385
                       577788888888888888888888444444.2333345999*******44..44444589*********************98753 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.103.7E-2624124IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019367.46E-2625124IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086314.52229122IPR003340B3 DNA binding domain
CDDcd100174.21E-1932120No hitNo description
PfamPF023621.1E-1632120IPR003340B3 DNA binding domain
SMARTSM010193.2E-1732122IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.3E-18292400IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.73E-16294400IPR015300DNA-binding pseudobarrel domain
CDDcd100172.18E-14299400No hitNo description
PROSITE profilePS5086310.997301402IPR003340B3 DNA binding domain
SMARTSM010192.7E-5301390IPR003340B3 DNA binding domain
PfamPF023621.0E-9302386IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 421 aa     Download sequence    Send to blast
MQSQGMAPKS QNCRMMEKHK YWKEFQSNRH QFFKIMIGDF RNQLRIPRKF MSNFRENLSG  60
TVYLRGPSGF MWAVEVERMF DEVVFGNGWQ NFVKDHSLAD ADFVVFRYDG NSTFNVVIFD  120
LSGCEREGSY FVKKHTSACS NGRCGFRRED GEGSEEVIDL DKVHENHMQK EKLTKGKGNT  180
ISKAVDTISQ LKATQEKHLR VGKSAIASGV IEILMDESEE SSGSATEASE DWSNNSVSLK  240
CKSKKKEVGC LTNDSGKMKR PLDLPGSYNL YFISNRRKIT EEEKQRPRRL AKQYSSTRPS  300
FSIVMKPTHV CKAFTVNIRE KWLDMHVPDK VRIALLRVAP DEKRWPVRIM RTKWRRGFAR  360
GWGKFVLDNN LEEHDVCVFE LNEEGQANKK SIGFNVVIFR VLDEIVPLTR FSNTQPNASD  420
*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A2e-1527438122127B3 domain-containing transcription factor VRN1
4i1k_B2e-1527438122127B3 domain-containing transcription factor VRN1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017969605.10.0PREDICTED: B3 domain-containing transcription factor VRN1
TrEMBLA0A061DHL10.0A0A061DHL1_THECC; DSBA oxidoreductase family protein isoform 1
STRINGEOX912660.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2512634
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.12e-33B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]