PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000597t4
Common NameTCM_000597
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 489aa    MW: 54959.7 Da    PI: 8.3161
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000597t4genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B360.13.7e-19127217199
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                       f k l++s+v++  ++ lp  f++ h ++ +++++++ledesg ++ vk+    +s+++ l++GW++F++ ++L egD+++F+l++ ++f  +
  Thecc1EG000597t4 127 FAKSLVRSHVGSCFWMGLPGMFCKIH-LP-RKDTMIILEDESGHQFHVKY----YSDKTGLSAGWRQFCSVHNLLEGDVLIFQLVEPTKF--K 211
                       889********************666.44.5788****************....555556***********************9876666..9 PP

                       EEEE-S CS
                B3  94 vkvfrk 99 
                       v ++r+
  Thecc1EG000597t4 212 VYIIRA 217
                       999886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.108.7E-22119218IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.04E-23120218IPR015300DNA-binding pseudobarrel domain
CDDcd100171.72E-26125216No hitNo description
PfamPF023628.9E-17127217IPR003340B3 DNA binding domain
PROSITE profilePS5086313.366127218IPR003340B3 DNA binding domain
SMARTSM010197.6E-17127218IPR003340B3 DNA binding domain
PfamPF052662.9E-7363474IPR007930Protein of unknown function DUF724
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 489 aa     Download sequence    Send to blast
MDQRVKKEAE EIHQRTMSFA GRRLKSTGEE DLTLAQLSHT PKLKPSSSGK KKKKKKKDKA  60
LTERKEKQKS QSESRIKHEV SGPGGKHMSS KKNKGAGDGK CTPEIKSPAM IRAEEVQSNL  120
EPEFPSFAKS LVRSHVGSCF WMGLPGMFCK IHLPRKDTMI ILEDESGHQF HVKYYSDKTG  180
LSAGWRQFCS VHNLLEGDVL IFQLVEPTKF KVYIIRANDL TELDGALGLL NLDAHTKQSD  240
AEIGITACKS TKRKRPKSLP LAVVQKKNKR SGLQTNLGQP AEQSENDSEE VGSEVLEGFK  300
LSVPTIQFKD ITSFENFSIL VGDLVIDSEL SEDIRNKYFK LCCSQNSFLH ENIIQGINFK  360
LIVGTISETV NIADAIRACK LTTSQDEFDS WDKTLKAFDL LGMNVGFLRT RLRRLVNLAF  420
ESEGAADARR YVEAKMERAQ TEDEIRNLEA KLAELKDTCK TFGVEIESLQ SQAETYELRF  480
EEEVQASW*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15056KKKKKKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5825576e-36JX582557.1 Gossypium hirsutum clone NBRI_GE15408 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007047227.20.0PREDICTED: B3 domain-containing protein Os01g0234100
TrEMBLA0A061DGW80.0A0A061DGW8_THECC; AP2/B3-like transcriptional factor family protein isoform 4
STRINGEOX913850.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G19184.11e-34B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]