PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006967t1
Common NameTCM_006967
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 445aa    MW: 51496.7 Da    PI: 7.3455
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006967t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B351.22.3e-1621112199
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                       ffk+ +p    +++ l++p+kf+ ++g++   s   t+  ++gr+W++ l  +k ++r++l +GW eF + n ++ g f+vF+++++s+f  +
  Thecc1EG006967t1  21 FFKIILPRTIAEKN-LTIPNKFVGKFGHE--LSGVATFVLPNGRKWKIGL--TKADDRIWLDDGWHEFIEYNSIRYGYFLVFRYEKDSTF--H 106
                       78888888888887.********999965..77799**************..********************************998999..9 PP

                       EEEE-S CS
                B3  94 vkvfrk 99 
                       v +f++
  Thecc1EG006967t1 107 VIIFDN 112
                       999987 PP

2B346.18.6e-153554401098
                       HHHTT-EE--HHH.HTT.---..--SEEEEEETTS-EEEEEE..EEETTE.EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
                B3  10 vlksgrlvlpkkfaeeh.ggkkeesktltledesgrsWevkliyrkksgr.yvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                       ++ s+ + +p  f +e+ +g++     +++ed++gr+W +++  +++++  ++l+kG  +F ++n+LkegD+++F+l +++ +  ++++fr
  Thecc1EG006967t1 355 NFDSSTMHVPAGFDTEYlSGVT-D--HIRVEDSDGREWFIEF--KRHRNFgILLRKGCYRFWRDNNLKEGDVCIFELRDKKAAVHKISIFR 440
                       6677889999999999976664.3..5***************..9999989*************************987688877777777 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.102.3E-2418115IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.16E-2318115IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086313.6220113IPR003340B3 DNA binding domain
CDDcd100172.45E-1620111No hitNo description
SMARTSM010194.0E-1521113IPR003340B3 DNA binding domain
PfamPF023624.1E-1321112IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.6E-17340442IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.47E-18340441IPR015300DNA-binding pseudobarrel domain
CDDcd100172.91E-12345440No hitNo description
PROSITE profilePS5086311.448346442IPR003340B3 DNA binding domain
SMARTSM010197.8E-12347442IPR003340B3 DNA binding domain
PfamPF023625.0E-13352441IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 445 aa     Download sequence    Send to blast
MAHRTRKRPR SNHVVDESLH FFKIILPRTI AEKNLTIPNK FVGKFGHELS GVATFVLPNG  60
RKWKIGLTKA DDRIWLDDGW HEFIEYNSIR YGYFLVFRYE KDSTFHVIIF DNSACEVDYP  120
SNDEELTDGE SEKHSIHQDS ETEEDDSPEV LGVKAPSLNE RANREAINGN QDASSQQHLV  180
RKPQVDAASH KNSEVKQDKV TRVKRCKTEE VEFDNLNESR QINSNIKELR RSHRLLPSKS  240
HLMNQNATGI HDQDLSVQLQ DLKQQFDGKK LKITIQRANL QSLEVMHKGN EANKETESGK  300
QDQHGSIQDE EIEIYVSRMF FGISSTSRDR ERAIRAVEVI KPKNPCFMII LRRGNFDSST  360
MHVPAGFDTE YLSGVTDHIR VEDSDGREWF IEFKRHRNFG ILLRKGCYRF WRDNNLKEGD  420
VCIFELRDKK AAVHKISIFR ADSN*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A2e-1732644228143B3 domain-containing transcription factor VRN1
4i1k_B2e-1732644228143B3 domain-containing transcription factor VRN1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007042301.20.0PREDICTED: B3 domain-containing protein REM21
TrEMBLA0A061E0I20.0A0A061E0I2_THECC; Uncharacterized protein
STRINGEOX981320.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15912578
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G01580.16e-29B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]