PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000597t2
Common NameTCM_000597
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 521aa    MW: 58836.1 Da    PI: 8.3892
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000597t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B3604.1e-19127217199
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                       f k l++s+v++  ++ lp  f++ h ++ +++++++ledesg ++ vk+    +s+++ l++GW++F++ ++L egD+++F+l++ ++f  +
  Thecc1EG000597t2 127 FAKSLVRSHVGSCFWMGLPGMFCKIH-LP-RKDTMIILEDESGHQFHVKY----YSDKTGLSAGWRQFCSVHNLLEGDVLIFQLVEPTKF--K 211
                       889********************666.44.5788****************....555556***********************9876666..9 PP

                       EEEE-S CS
                B3  94 vkvfrk 99 
                       v ++r+
  Thecc1EG000597t2 212 VYIIRA 217
                       999886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.109.7E-22119218IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.16E-23120218IPR015300DNA-binding pseudobarrel domain
CDDcd100174.08E-26125216No hitNo description
PROSITE profilePS5086313.366127218IPR003340B3 DNA binding domain
PfamPF023629.9E-17127217IPR003340B3 DNA binding domain
SMARTSM010197.6E-17127218IPR003340B3 DNA binding domain
PfamPF052663.3E-7395506IPR007930Protein of unknown function DUF724
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 521 aa     Download sequence    Send to blast
MDQRVKKEAE EIHQRTMSFA GRRLKSTGEE DLTLAQLSHT PKLKPSSSGK KKKKKKKDKA  60
LTERKEKQKS QSESRIKHEV SGPGGKHMSS KKNKGAGDGK CTPEIKSPAM IRAEEVQSNL  120
EPEFPSFAKS LVRSHVGSCF WMGLPGMFCK IHLPRKDTMI ILEDESGHQF HVKYYSDKTG  180
LSAGWRQFCS VHNLLEGDVL IFQLVEPTKF KVYIIRANDL TELDGALGLL NLDAHTKQSD  240
AGKLLVWYLE YYACISDGLH YNRIFEKKKK EDAEIGITAC KSTKRKRPKS LPLAVVQKKN  300
KRSGLQTNLG QPAEQSENDS EEVGSEVLEG FKLSVPTIQF KDITSFENFS ILVGDLVIDS  360
ELSEDIRNKY FKLCCSQNSF LHENIIQGIN FKLIVGTISE TVNIADAIRA CKLTTSQDEF  420
DSWDKTLKAF DLLGMNVGFL RTRLRRLVNL AFESEGAADA RRYVEAKMER AQTEDEIRNL  480
EAKLAELKDT CKTFGVEIES LQSQAETYEL RFEEEVQASW *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15056KKKKKKK
2267286KKKEDAEIGITACKSTKRKR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5825577e-36JX582557.1 Gossypium hirsutum clone NBRI_GE15408 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007047227.20.0PREDICTED: B3 domain-containing protein Os01g0234100
TrEMBLA0A061DGD20.0A0A061DGD2_THECC; AP2/B3-like transcriptional factor family protein isoform 2
STRINGEOX913850.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM168371012
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G19184.12e-34B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]