PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG015085t1
Common NameTCM_015085
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family ARR-B
Protein Properties Length: 681aa    MW: 73690.7 Da    PI: 8.6751
Description ARR-B family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG015085t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1G2-like80.61.8e-25232285155
           G2-like   1 kprlrWtpeLHerFveaveqLGGsekAtPktilelmkvkgLtlehvkSHLQkYRl 55 
                       k++++Wt+ LH++F+ea++q+ G +kA+Pk+ile+m+v+gLt+e+v+SHLQkYR+
  Thecc1EG015085t1 232 KAKVVWTNSLHNKFLEALRQI-GLDKAVPKKILEIMNVPGLTRENVASHLQKYRI 285
                       6799*****************.********************************6 PP

2Response_reg67.84.8e-236415616109
                       HHHHHTTCEEEEEESSHHHHHHHHHHHH.ESEEEEESSCTTSEHHHHHHHHHHHTTTSEEEEEESTTTHHHHHHHHHTTESEEEESS--HHHH CS
      Response_reg  16 qalekegyeevaeaddgeealellkekd.pDlillDiempgmdGlellkeireeepklpiivvtahgeeedalealkaGakdflsKpfdpeel 107
                       ++l +  +  va+++++ +al +l++k  +Dl+++D+ mpgm+G+el ++i++e  ++p+i++++ ++e+ +l+ l+ Ga  f++Kp++p++l
  Thecc1EG015085t1  64 RMLVDIVM-MVATVKSPADALSTLRAKPgIDLVVTDLHMPGMNGIELQRQINREF-RVPVIIMSSDEQESVMLQSLEEGAVFFIAKPVKPDDL 154
                       56666667.9*****************99**********************9988.9***********************************9 PP

                       HH CS
      Response_reg 108 vk 109
                        +
  Thecc1EG015085t1 155 KN 156
                       86 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.40.50.23007.2E-3119162No hitNo description
SMARTSM004483.0E-1821158IPR001789Signal transduction response regulator, receiver domain
PROSITE profilePS5011034.65222162IPR001789Signal transduction response regulator, receiver domain
CDDcd001563.57E-2424162No hitNo description
SuperFamilySSF521721.57E-2362163IPR011006CheY-like superfamily
PfamPF000724.8E-1965156IPR001789Signal transduction response regulator, receiver domain
PROSITE profilePS5129412.049229288IPR017930Myb domain
SuperFamilySSF466893.4E-19230289IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.6E-26230289IPR009057Homeodomain-like
TIGRFAMsTIGR015572.4E-25232285IPR006447Myb domain, plants
PfamPF002494.3E-6235284IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0000160Biological Processphosphorelay signal transduction system
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0016310Biological Processphosphorylation
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0016301Molecular Functionkinase activity
Sequence ? help Back to Top
Protein Sequence    Length: 681 aa     Download sequence    Send to blast
MASKRSISAE HSNPINSLLK VTILVVDDDS TSLAIVSAML KEWRYEGIRK IFCLGSLYFN  60
DLSRMLVDIV MMVATVKSPA DALSTLRAKP GIDLVVTDLH MPGMNGIELQ RQINREFRVP  120
VIIMSSDEQE SVMLQSLEEG AVFFIAKPVK PDDLKNVWQY AIAAKKGKSV VIEEIASTEG  180
EAPSAGKVSK DEVRSVASVK DDKNNAKKGT KRKASRKSKD DQEDVTGSAP KKAKVVWTNS  240
LHNKFLEALR QIGLDKAVPK KILEIMNVPG LTRENVASHL QKYRIFLKRV AERGCFASKA  300
FVEKILRSSF ASGHPLLLKT AQEYARLAEL QQKRGLTFRP EYGGYVSYQN AHNAATHGSV  360
LFPYQNASSS NSAQRHACGQ SHLLLGNQAN NKRLVSGNTN PLYQGNRLGF ANGSNFSLNG  420
SLTNATNGLM NGANSRHTYQ QQIQARQNFH SAGFPSQFRF GSSSLHSSNS TLGTGNIGSI  480
STSYPTLNSS CSNNNSYAGV RLTTGGQLIE MGQTRLNGCY GSMDGTYNEE MNVAAMGNQT  540
FGYMGQGGSS SAGLNNGANQ VSPANTAANT SMLPGLDNNG GAKHYKSGHL MNNAPTFDNI  600
TPQQLGDGSL SDLLLESKNY QFPCQQQDGG DGVQSPDFLS SSIFSEIFPS LEELLNSDFS  660
ESLSLEDTAP QNEEALEKAS *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1irz_A1e-21228291164ARR10-B
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017972600.10.0PREDICTED: putative two-component response regulator ARR21
TrEMBLA0A061G1K50.0A0A061G1K5_THECC; Two-component system sensor histidine kinase/response regulator, putative
STRINGEOY230870.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM162131014
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G07210.11e-61response regulator 21
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]