PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG038943t1
Common NameTCM_038943
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family ARR-B
Protein Properties Length: 506aa    MW: 56395.3 Da    PI: 5.0575
Description ARR-B family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG038943t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1G2-like66.25.8e-21195249155
           G2-like   1 kprlrWtpeLHerFveaveqLGGsekAtPktilelmkvkgLtlehvkSHLQkYRl 55 
                       k+rl WtpeL ++Fv+av+ L  s    Pk+il++m+ + Lt+e+++SHLQkYR+
  Thecc1EG038943t1 195 KKRLAWTPELDAKFVKAVQTLSKSSMVHPKRILKIMNEPELTRENIASHLQKYRI 249
                       78*******************9999*****************************7 PP

2Response_reg57.11e-19271371111
                       EEEESSSHHHHHHHHHHHHHTTCEEEEEESSHHHHHHHHHHHH..ESEEEEESSCTTSEHHHHHHHHHHHTTTSEEEEEESTTTHHHHHHHHH CS
      Response_reg   1 vlivdDeplvrellrqalekegyeevaeaddgeealellkekd..pDlillDiempgmdGlellkeireeepklpiivvtahgeeedalealk 91 
                       vl +d + + ++ l+ +l k +y +v++++++ eale+l++k+  ++++l+D++  +++G++ll+ i  e   lp+i+vt+ g+ e + + l 
  Thecc1EG038943t1  27 VLAIDAQIFSLQYLCAVLHKCNY-KVKTTTSAAEALEILRAKKyeFNIVLVDVDSANINGFKLLEIIGLEM-YLPVIMVTGDGSLENIMKGLI 117
                       56677788889999*********.***************998888**********************5544.7******************** PP

                       TTESEEEESS--HHHHHHHH CS
      Response_reg  92 aGakdflsKpfdpeelvkav 111
                       +Ga d++ Kp+  +e+ +++
  Thecc1EG038943t1 118 YGAVDYIIKPVGVQEIKNTI 137
                       ****************9876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF521724.41E-2424164IPR011006CheY-like superfamily
Gene3DG3DSA:3.40.50.23008.6E-2424166No hitNo description
SMARTSM004482.8E-1825137IPR001789Signal transduction response regulator, receiver domain
PROSITE profilePS5011030.63726141IPR001789Signal transduction response regulator, receiver domain
PfamPF000721.1E-1427137IPR001789Signal transduction response regulator, receiver domain
CDDcd001561.41E-1628137No hitNo description
SuperFamilySSF466896.63E-15192253IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.6E-22193253IPR009057Homeodomain-like
TIGRFAMsTIGR015576.2E-20195249IPR006447Myb domain, plants
PfamPF002497.4E-7197248IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0000160Biological Processphosphorelay signal transduction system
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 506 aa     Download sequence    Send to blast
MEESFLNKSS ENSVVEVDDN TVHELQVLAI DAQIFSLQYL CAVLHKCNYK VKTTTSAAEA  60
LEILRAKKYE FNIVLVDVDS ANINGFKLLE IIGLEMYLPV IMVTGDGSLE NIMKGLIYGA  120
VDYIIKPVGV QEIKNTIWHS VTLNKMWDSE QSTDNEETSD HQNLKPLDHS NATITVEDDT  180
DNIRLDDASS SCQKKKRLAW TPELDAKFVK AVQTLSKSSM VHPKRILKIM NEPELTRENI  240
ASHLQKYRIS LKKRKAEMNQ QGFKIKSVTR CSSTRRSNCR NGEAGDVNAD RVAVPSFNPF  300
HSFEDMNSIF LDPSRGGNTV MSQHRIPYVG LLDPEKPYQS VPYSCLDDPN FQTPDFRSFS  360
YYNYCLGMNI QPHNLGSEPL SGTTSRSPYF HDMGSEPSTP SSTPFFPFSN EFFAPETDVA  420
FQSPFAVASV PNLFPGSTGL TRNFGETELA SNGTTKSRED TVSEMDDNQL SDHGLPGSLR  480
ILATYSSQQP EVSCMAESDL AHHAC*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1irz_A3e-14191253162ARR10-B
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021277381.10.0two-component response regulator ORR22-like
TrEMBLA0A061GXH10.0A0A061GXH1_THECC; Type-b response regulator
STRINGEOY318040.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2076546
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G25180.16e-50response regulator 12
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]