PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006970t1
Common NameTCM_006970
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 466aa    MW: 53888.9 Da    PI: 8.6482
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006970t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B353.83.5e-1737128199
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                       ffk+ +p+   +++ l +pkkf+ ++g++   s   t+  ++gr+W++ l  +k ++r++l +GW eFv+ n ++ g f++F+++++s+f  +
  Thecc1EG006970t1  37 FFKIILPHTIAEKK-LKIPKKFVGKFGHE--LSSLATFVLPNGRKWKIGL--TKADDRIWLDDGWHEFVEYNSIRYGYFLIFRYERNSTF--H 122
                       77777676666665.********999965..7889***************..********************************999999..9 PP

                       EEEE-S CS
                B3  94 vkvfrk 99 
                       v +f++
  Thecc1EG006970t1 123 VIIFDN 128
                       999987 PP

2B350.34.3e-163774611198
                       HHTT-EE--HHH.HTT.---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
                B3  11 lksgrlvlpkkfaeeh.ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                       ++s+ + +p  fa+e+ +g++     +++ed++gr+W ++l+ r++ ++ +l+kG  +F ++n+LkegD++ F+l +++ +  ++++fr
  Thecc1EG006970t1 377 FNSSTMPVPAGFAAEYlSGVT-D--HIRVEDSDGREWFIELR-RQNNCTLILRKGCHRFWRDNNLKEGDVCFFELRDKKAAVHKISIFR 461
                       6677899*********86664.3..5***************5.666678*************************987688877777777 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.109.5E-2632131IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.96E-2433131IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086314.00136129IPR003340B3 DNA binding domain
CDDcd100171.27E-1736127No hitNo description
SMARTSM010197.7E-1737129IPR003340B3 DNA binding domain
PfamPF023621.2E-1337128IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.107.5E-19361463IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019363.73E-18361462IPR015300DNA-binding pseudobarrel domain
CDDcd100174.44E-14366461No hitNo description
PROSITE profilePS5086311.871367463IPR003340B3 DNA binding domain
SMARTSM010191.0E-10368463IPR003340B3 DNA binding domain
PfamPF023621.2E-14376462IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 466 aa     Download sequence    Send to blast
MHVIQDIFFE GLNKLMAHRT RKRPRSNHVV EEESLHFFKI ILPHTIAEKK LKIPKKFVGK  60
FGHELSSLAT FVLPNGRKWK IGLTKADDRI WLDDGWHEFV EYNSIRYGYF LIFRYERNST  120
FHVIIFDNSA CEVDYPSYVP SNDEELNDGE SEKHSIHQDS ETEEDDSPEF LGVKAPNLDE  180
RTNREAIISG KQAASSQQRL VRKPQGDAAS HKNAEVKQAK VTGVRRCKTE EVEFDHLNES  240
RQINSNIKEL RRSHRLLPSK SHLMNQNATG IHDQDLSVQL RDLKQQFDGK KLKITIQRAN  300
LQSLEVMHKG NEANKKTESG KQGQHGSIQD EETEIYVSRM FFGISSTSRD RERAIRAVEV  360
IKPMNPCFMI ILRRCHFNSS TMPVPAGFAA EYLSGVTDHI RVEDSDGREW FIELRRQNNC  420
TLILRKGCHR FWRDNNLKEG DVCFFELRDK KAAVHKISIF RADSN*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A2e-1634746328143B3 domain-containing transcription factor VRN1
4i1k_B2e-1634746328143B3 domain-containing transcription factor VRN1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007042308.20.0PREDICTED: B3 domain-containing protein Os03g0619600
TrEMBLA0A061DZE90.0A0A061DZE9_THECC; Uncharacterized protein
STRINGEOX981390.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15912578
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.13e-30B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]