PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG016186t1
Common NameTCM_016186
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 672aa    MW: 74493.6 Da    PI: 5.9132
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG016186t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS425.63.9e-1302896561374
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                       lv+lL++c+ea+ s++ +++++++a+l +lasp+g+++ Rl+ay+teALa r++r +++++++ +p+e + +  +++ +al+l+++vsPi+kf
  Thecc1EG016186t1 289 LVHLLTACVEAIGSKNIAAINHFMAKLGDLASPRGSAISRLTAYYTEALALRVTRLWPHIFHITTPRELD-RVDDDNGTALRLLNQVSPIPKF 380
                       68*****************************************************************998.5678888999************ PP

              GRAS  94 shltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrl 186
                        h+t N+ +l+a+eg++rvHiiDfdi+qGlQWp+L+q+LasR+++p ++R+Tg+g+    sk+el+etg+rLa fAe+l++pfef++ v++rl
  Thecc1EG016186t1 381 VHFTSNEILLRAFEGKDRVHIIDFDIKQGLQWPSLFQSLASRTNPPSHVRVTGIGE----SKQELNETGDRLAGFAEALNLPFEFHP-VVDRL 468
                       ********************************************************....***************************.7**** PP

              GRAS 187 edleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpres 279
                       ed++l++L+vk++E++aVn+v+qlh++l +    +   +++L l++s++P vv+v+eqea+hn  s  +r++++l+yysa+fds++++lp es
  Thecc1EG016186t1 469 EDVRLWMLHVKEKESVAVNCVFQLHKTLYDGNGGA--LRDFLGLLRSTNPAVVIVAEQEAEHNVLSIDARVTNSLRYYSAIFDSMDSSLPLES 559
                       ****************************6554444..589***************************************************** PP

              GRAS 280 eerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee...sgslvlgWkdrpLvs 369
                         rik+E++ ++rei+n++aceg++r+erhe++ekWr+ +e+ GF+ + +se++  q ++ll++++ + y+v+++    ++l+l W d+pL+s
  Thecc1EG016186t1 560 PIRIKIEEM-FAREIRNIIACEGSDRFERHESFEKWRKLMEQGGFRCMGVSERELLQSQMLLKMYSCENYSVKKQgqdGAALTLSWLDQPLYS 651
                       *********.********************************************************999****6544266777********** PP

              GRAS 370 vSaWr 374
                       vSaW+
  Thecc1EG016186t1 652 VSAWT 656
                       ****6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098558.839263633IPR005202Transcription factor GRAS
PfamPF035141.4E-127289656IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 672 aa     Download sequence    Send to blast
MLAGCSSSTL LSPRHRLRSE ASAQFQACHF QTSMSTQRLD LPCSFSRKDT SRSQPIRPVG  60
LSVEKPIESK TTGCSLKQNI RLPPLTTTAQ NPFEGRREIK DEFWEKGKSL KRFAEQGLVD  120
ESVINRAKRK KGSSDNEDSG DIHEGGGDNL SLGQLGAGNF WFQPSFTGQN APQVPFSLTC  180
SGDEERVCFV PSEVISPPLP LSNNPWIESV ITEITDVGEK DVETIHRPAN ETSGSSTSSE  240
SHSLGLRLNE QATEQEVGNG SGNPYPHDGA RLGANAEENN HGEHQGFELV HLLTACVEAI  300
GSKNIAAINH FMAKLGDLAS PRGSAISRLT AYYTEALALR VTRLWPHIFH ITTPRELDRV  360
DDDNGTALRL LNQVSPIPKF VHFTSNEILL RAFEGKDRVH IIDFDIKQGL QWPSLFQSLA  420
SRTNPPSHVR VTGIGESKQE LNETGDRLAG FAEALNLPFE FHPVVDRLED VRLWMLHVKE  480
KESVAVNCVF QLHKTLYDGN GGALRDFLGL LRSTNPAVVI VAEQEAEHNV LSIDARVTNS  540
LRYYSAIFDS MDSSLPLESP IRIKIEEMFA REIRNIIACE GSDRFERHES FEKWRKLMEQ  600
GGFRCMGVSE RELLQSQMLL KMYSCENYSV KKQGQDGAAL TLSWLDQPLY SVSAWTPVDV  660
AGSSSSFSQP S*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A2e-7329665726380Protein SCARECROW
5b3h_A2e-7329665725379Protein SCARECROW
5b3h_D2e-7329665725379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017972132.10.0PREDICTED: scarecrow-like protein 28
SwissprotQ9CAN30.0SCL28_ARATH; Scarecrow-like protein 28
TrEMBLA0A061G6280.0A0A061G628_THECC; GRAS family transcription factor
STRINGEOY246320.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM72962743
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G63100.10.0GRAS family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]