PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG015991t1
Common NameTCM_015991
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 796aa    MW: 87113.4 Da    PI: 5.9876
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG015991t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS329.18.4e-1014347943374
              GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfsh 95 
                       +lL+++Ae v +g++  aq +Larl+++ sp g+p+qR+a+yf+eAL+  l    ++ ++  p ++ ++ +++ ++ a+k+fsevsP+++f +
  Thecc1EG015991t1 434 DLLYQAAELVGTGNFLHAQGILARLNHQLSPVGKPLQRAAFYFKEALQLLLIM--NNPVSPPPLRSPTPFDVIFKMGAYKVFSEVSPLIQFVN 524
                       79************************************************998..5556677777777888********************** PP

              GRAS  96 ltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrled 188
                       +t+Nqa+lea++ ++r+Hi+Dfdi+ G+QW++++q+L  R++g+pslRiT+++sp++++  el  ++e+L +fA+e+gv+fe +vl  + l++
  Thecc1EG015991t1 525 FTCNQALLEALDDADRIHIVDFDIGFGAQWASFMQELPMRSRGAPSLRITAFASPSTHHPIELGLMRENLMQFANEIGVNFELEVLNFDCLDQ 617
                       *************************************************************************************97777766 PP

              GRAS 189 l..eleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpres 279
                          +l ++r +++Ea+aVn+ +   +  +++ +l++    +L++vk+lsPk++v  ++  d+n+ +F +++++a++ y  lf+sl+a    +s
  Thecc1EG015991t1 618 TpySLPMFRSNENEAVAVNFPV--WSSSNQPSALPN----LLRFVKQLSPKIMVSLDRGGDRNDLPFPQHIIHAFQSYINLFESLDAV-NVTS 703
                       555677889999********99..444455666666....*********************************************665.579* PP

              GRAS 280 eerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSa 372
                       ++ +k+E++l+ ++i++ v  +  +     e++  Wr  +++aGF+pv++s+++++qa++++++ +++g+rve++++slvl+W +  L+svSa
  Thecc1EG015991t1 704 DAVNKIEKFLFVPRIESTVLGRLHA----PEKMPLWRTLFSSAGFSPVTFSNFTETQAECVVKRAQVRGFRVEKRQASLVLCWLQGDLISVSA 792
                       *******************998887....9*************************************************************** PP

              GRAS 373 Wr 374
                       Wr
  Thecc1EG015991t1 793 WR 794
                       *8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098546.749406774IPR005202Transcription factor GRAS
PfamPF035142.9E-98434794IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 796 aa     Download sequence    Send to blast
MHFQLQAKGA VELAGFASIC QQDKWIKQQE ANSFSFANSF YYNNEQEPTS VLHMRRSQSP  60
PTSASTLSSS FNGGAAGAGG GGNSTDNTTT TAATIAPPET SLPNNNKEEW ATELQPIPSE  120
LDLVPGPGGG QRCNLGLEDW ETMLSESAVS PSQDHSFLGW ITGDVNDPSF GLKQLLQSGS  180
TGPNPGLDFE GNAGLGVVDQ GPGFDPIGSL NSSGPGNVIS SAAPNLGGFP GSGFLPNTSN  240
NGNGKIGSVM PSSSSVGVVN NHKVLGASVG LNTNIQNPVF TSPANNIGLP VSLPMLYQQQ  300
QQGQYVESQE EKPQILNAQV LMVQQQHPQN PNFFLPLPQE HHLLQPLPKR LNPGNLELSS  360
QIPKLQFSDA GHELFMRKQQ QQHMGFPHGV QFVPQQKPLM VAKQKVLGPG EEMAQQQQQH  420
QYQLHQQQQT TLFDLLYQAA ELVGTGNFLH AQGILARLNH QLSPVGKPLQ RAAFYFKEAL  480
QLLLIMNNPV SPPPLRSPTP FDVIFKMGAY KVFSEVSPLI QFVNFTCNQA LLEALDDADR  540
IHIVDFDIGF GAQWASFMQE LPMRSRGAPS LRITAFASPS THHPIELGLM RENLMQFANE  600
IGVNFELEVL NFDCLDQTPY SLPMFRSNEN EAVAVNFPVW SSSNQPSALP NLLRFVKQLS  660
PKIMVSLDRG GDRNDLPFPQ HIIHAFQSYI NLFESLDAVN VTSDAVNKIE KFLFVPRIES  720
TVLGRLHAPE KMPLWRTLFS SAGFSPVTFS NFTETQAECV VKRAQVRGFR VEKRQASLVL  780
CWLQGDLISV SAWRC*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A1e-3445179422375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017973382.10.0PREDICTED: scarecrow-like protein 22
TrEMBLA0A061GBD80.0A0A061GBD8_THECC; GRAS family transcription factor
STRINGEOY243740.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16534813
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00150.11e-116GRAS family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]