PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG034392t3
Common NameTCM_034392
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB_related
Protein Properties Length: 710aa    MW: 78246.2 Da    PI: 4.5019
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG034392t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding32.22.5e-10556599146
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
   Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                       r +WT+++ el++ a++q+G++ +++I + ++ gR++ q+k+++ +
  Thecc1EG034392t3 556 RAKWTKQDTELFYGAIRQFGPD-FSLIQQLFP-GRSRHQIKLKFKN 599
                       569*****************88.*********.**********987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF159631.4E-26549631No hitNo description
PROSITE profilePS5129310.747554605IPR017884SANT domain
SMARTSM007176.7E-9555603IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.9E-6556599IPR009057Homeodomain-like
SuperFamilySSF466893.36E-10558606IPR009057Homeodomain-like
CDDcd001673.80E-7558599No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 710 aa     Download sequence    Send to blast
MDDFDIFSDK SLVTQARAGA KFQPRAKLKS RKENVASIPC NKKEEVVTQS LSSLDNRLTS  60
PRGLSVDTST TSGTEESLKT NYEDLSQIAV NKADDVGFID APRSDIPVTV NDHDSHSVPN  120
IWAKDVDFDL DPFDDVANQV TNNGRAGGKF EPKTRPKACI VDPDGVVDDC GSSEVMTSSQ  180
VAVTDSLLSE VAVSNGCHDS HSSFGRSVGE NADIFSGLEC LDQFLTQSSN NNGGIQIDDE  240
GTGAQEAGAF PDVETQDIMS GATIASDPST SEFPVNEELT NLTEASNPGV TLSGDFPSMP  300
GKLSSNSRKR EASPIPNPSQ KSKQSSASDG GNENGKATKR LRKQVTSPKL VDDHEDGTCN  360
DEGLATEPPT SSAIDEDRDD GDDDDEYNAE SAFSKRRTSR RSKKPMAENE KPPRKRKTAK  420
EKQVEKQKKV NEASDQPTEE QPRKFSHSTR RKRRFVDESL LHTPEDEIDF AKVALKDIIL  480
LADYKERIAK KEAKASKIPL TNQSTKNTFP EENAHNEESS IASEQDQGFT DDQMSGGAQS  540
SSFFLNYQSY MDKEPRAKWT KQDTELFYGA IRQFGPDFSL IQQLFPGRSR HQIKLKFKNE  600
ERRYPLRLSE ALASRTKDHS YFEKVIEQLQ QVAGQAERES TGDVSMDLTR EEAELTPEAN  660
GKATKAEQDE DEAVGDQQAD VAEDRSTFKS DETDDDDHED ILSSYRSAF*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1394416KRRTSRRSKKPMAENEKPPRKRK
2410416KPPRKRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017981354.10.0PREDICTED: transcription factor TFIIIB component B'' isoform X5
TrEMBLA0A061FEE10.0A0A061FEE1_THECC; Homeodomain superfamily protein isoform 3
STRINGEOY152610.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G39160.19e-44MYB_related family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]