PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG041814t1
Common NameTCM_041814
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 791aa    MW: 89125.2 Da    PI: 4.925
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG041814t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS378.39.4e-1164117841373
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                       l++lL+ cA+avs++d + a +lL++++e++sp gd +qRla++f+++L+arl +s++ + +   +s +s+++ ++ l+a++++  ++P+ k+
  Thecc1EG041814t1 411 LRTLLILCAQAVSADDRRTAGELLKQIKEHSSPLGDGTQRLAHFFANGLEARLDGSGTAIQNLY-SSLASKTTAADMLKAYQVYLCACPFKKL 502
                       5789***************************************************888887655.555556889******************* PP

              GRAS  94 shltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvak 184
                       s + aN++I   +e+++++Hi+Df+i +G+QWp L+q L++Rp+gpp+lRiTg++ p+ g   +e++eetg+rL++++++++vpfe+n+++a+
  Thecc1EG041814t1 503 SIFFANKMIWHMAEKASALHIVDFGILYGFQWPILIQHLSKRPGGPPKLRITGIEIPQRGfrPAERIEETGRRLERYCKRFDVPFEYNPMAAQ 595
                       **********************************************************99******************************9** PP

              GRAS 185 rledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpr 277
                       ++e++++e++++k++E+laVn+ ++ ++llde+++++ +r++vLkl+++++P+++v++  + ++n++ Fl+rf eal + sa+fd++e++lpr
  Thecc1EG041814t1 596 NWETIQVEDIKIKSNEMLAVNCLFRFKNLLDETAEVDCPRNAVLKLIRKMNPDIFVHSIDNGSYNAPFFLTRFREALFHLSAMFDMFENTLPR 688
                       ********************************************************************************************* PP

              GRAS 278 eseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsv 370
                       e+ +r   Ere+ gre++nvvaceg+er+er et+++W+ r  +aGFkp+pl+++ +k++++ l+  + + + ++e+++++++gWk+r L++ 
  Thecc1EG041814t1 689 EEPARLLFEREFYGREAMNVVACEGSERVERPETYKQWQVRTIRAGFKPLPLNQELMKTVRAKLKSWYHKDFVIDEDNHWMLQGWKGRILYAS 781
                       ********************************************************************888********************** PP

              GRAS 371 SaW 373
                       ++W
  Thecc1EG041814t1 782 TCW 784
                       *** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098568.037385765IPR005202Transcription factor GRAS
PfamPF035143.2E-113411784IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 791 aa     Download sequence    Send to blast
MVMDPKFTEF TDYINGFGVE DDALLFTSGQ YPNFTNGLEF NVSSPDLGFM SANVPVIPPN  60
PDPGISVPPA TVSSDGSSFS ASTGWSPDGE SSSPSDDSDS TDPVLKYIRQ MLMEENMEDK  120
PFMFNDYLAL EDTEKSLYEV LGEQYPPSNQ PQPFLNVNVE SPDSNLSGNS RDNGSNSNST  180
TSISTSNGTS NYIDHWGVGE VVEHAPSLLQ APLSGDYHFQ SNLQQPSSQF SVNSTNSSSN  240
MGNGLMESSL SELLVQNIFS DKESVLQFQR GFEEASKFLP SSNQLIIDLE SNKFPMVQKG  300
KVPNLVVKVE KDERENSPDE LRGRKNHERD DGGLEEERSN KQSAVYTEES DLSDMFDKVL  360
LCTDGKAMCG YNKALQQGET KTLLQKEQSN ESSVGKTRSK KQEKKKETVD LRTLLILCAQ  420
AVSADDRRTA GELLKQIKEH SSPLGDGTQR LAHFFANGLE ARLDGSGTAI QNLYSSLASK  480
TTAADMLKAY QVYLCACPFK KLSIFFANKM IWHMAEKASA LHIVDFGILY GFQWPILIQH  540
LSKRPGGPPK LRITGIEIPQ RGFRPAERIE ETGRRLERYC KRFDVPFEYN PMAAQNWETI  600
QVEDIKIKSN EMLAVNCLFR FKNLLDETAE VDCPRNAVLK LIRKMNPDIF VHSIDNGSYN  660
APFFLTRFRE ALFHLSAMFD MFENTLPREE PARLLFEREF YGREAMNVVA CEGSERVERP  720
ETYKQWQVRT IRAGFKPLPL NQELMKTVRA KLKSWYHKDF VIDEDNHWML QGWKGRILYA  780
STCWIPAQES *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A8e-493967864380Protein SCARECROW
5b3h_A8e-493967863379Protein SCARECROW
5b3h_D8e-493967863379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021278110.10.0scarecrow-like protein 14
RefseqXP_021278112.10.0scarecrow-like protein 14
RefseqXP_021278113.10.0scarecrow-like protein 14
RefseqXP_021278114.10.0scarecrow-like protein 14
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A061H0160.0A0A061H016_THECC; GRAS family transcription factor isoform 1
STRINGEOY339990.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]