PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG044257t1
Common NameTCM_044257
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family ARR-B
Protein Properties Length: 453aa    MW: 50028.6 Da    PI: 6.5728
Description ARR-B family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG044257t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1G2-like62.11.1e-19188242155
           G2-like   1 kprlrWtpeLHerFveaveqLGGsekAtPktilelmkvkgLtlehvkSHLQkYRl 55 
                       k rl+WtpeL ++Fv+av+ L       Pk+il +m+ +gL +++v+SHLQkYR+
  Thecc1EG044257t1 188 KRRLVWTPELDAKFVRAVQTLSKGSMVHPKRILAIMNEPGLNRAKVASHLQKYRM 242
                       68*******************9889999**************************8 PP

2Response_reg54.56.7e-19191281110
                       EEEESSSHHHHHHHHHHHHHTTCEEEEEESSHHHHHHHHHHHH..ESEEEEESSCTTSEHHHHHHHHHHHTTTSEEEEEESTTTHHHHHHHHH CS
      Response_reg   1 vlivdDeplvrellrqalekegyeevaeaddgeealellkekd..pDlillDiempgmdGlellkeireeepklpiivvtahgeeedalealk 91 
                       vl +d + + ++ l ++l k +y +v++++++ eale+l++++  +D +l+D++  +++G++ll+ i  e   lp+i+vt+ g+ e +++ l 
  Thecc1EG044257t1  19 VLAIDAQIFSLQYLSVVLHKCNY-KVKTTTSAAEALEILRANKyeFDTVLVDVDSATIKGFKLLEIIGLEM-YLPVIMVTGDGSLENIVKGLI 109
                       5677778888999**********.***************999989**********************5544.7******************** PP

                       TTESEEEESS--HHHHHHH CS
      Response_reg  92 aGakdflsKpfdpeelvka 110
                        Ga d++ Kp+  +e+ + 
  Thecc1EG044257t1 110 HGAVDYIIKPVGVQEIKNS 128
                       **************99875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd001561.61E-159129No hitNo description
SuperFamilySSF521725.41E-2416148IPR011006CheY-like superfamily
Gene3DG3DSA:3.40.50.23003.3E-2316159No hitNo description
SMARTSM004483.7E-1717129IPR001789Signal transduction response regulator, receiver domain
PROSITE profilePS5011028.63418133IPR001789Signal transduction response regulator, receiver domain
PfamPF000721.4E-1419127IPR001789Signal transduction response regulator, receiver domain
SuperFamilySSF466891.0E-13185246IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.0E-21186246IPR009057Homeodomain-like
TIGRFAMsTIGR015578.5E-18188242IPR006447Myb domain, plants
PfamPF002491.1E-5191241IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0000160Biological Processphosphorelay signal transduction system
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 453 aa     Download sequence    Send to blast
MEENSVVEVD DNAVHELRVL AIDAQIFSLQ YLSVVLHKCN YKVKTTTSAA EALEILRANK  60
YEFDTVLVDV DSATIKGFKL LEIIGLEMYL PVIMVTGDGS LENIVKGLIH GAVDYIIKPV  120
GVQEIKNSLW HCVSLNNAYW GSQQSMDSQE SSDHQSLKPL EHSNASITVE DDKHSTPLDD  180
ASSSCQKKRR LVWTPELDAK FVRAVQTLSK GSMVHPKRIL AIMNEPGLNR AKVASHLQKY  240
RMSLKKQQGF DIESVTRYSS TKRNNRRNGK AGGVNADPLA VPSFNPFHSL EDINSIFLDP  300
IGGGNTVMSQ HRIPYGGLLV DPQKPYQSVP YSCLDDPNFQ TPDFKSFNYY NYCLGMNIQP  360
HDLGSEPLPG TTSRSPYFHD VGSEASTPSS APFYPSSNAF LAPETDVAFQ SPFAVASAPN  420
LFPVKRSWTT WQSTNLGNLL LPAARRLLSC GE*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007010439.20.0PREDICTED: two-component response regulator ARR12
TrEMBLA0A061FWT30.0A0A061FWT3_THECC; Type-b response regulator
STRINGEOY192490.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2076546
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G31920.18e-46response regulator 10
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]