PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cc01_g04300
Common NameGSCOC_T00039314001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Coffeeae; Coffea
Family GRAS
Protein Properties Length: 579aa    MW: 65973.8 Da    PI: 7.8034
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cc01_g04300genomeCGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3534.5e-1081965591367
         GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlta 98 
                  +++lL++cA+ +s+ d++ a++lL+++++++sp+gd ++Rla+yf+ AL+ar++++++++y a++++ ++    +++l+a++++  +sP+ k+s++++
  Cc01_g04300 196 MRSLLTQCAQSISDFDNRTANQLLNQIRQHSSPHGDGNERLAHYFADALEARVSGMGTTMYTAFSTRVSA----ADTLKAYQAYILASPFRKMSNILT 289
                  5799*********************************************************999999986....99********************** PP

         GRAS  99 NqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeL 194
                   ++I + + +++++HiiDf+i +G++Wp+++qaL++Rp+gpp+l iTg++ p++g   +e++eetg+rLa+++++++vpfe+++ +a+++e+++leeL
  Cc01_g04300 290 GKTIQKLTTEASQIHIIDFGILYGFHWPCFIQALSKRPRGPPKLCITGIDLPQPGlrPAERVEETGRRLAHYCKKFNVPFEYHA-IARKWETISLEEL 386
                  ************************************************************************************.7************ PP

         GRAS 195 rvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgr 292
                  ++++ E+++  + ++l +++de+  ++s+rd+vL+l+k+++P+++v+   +  +n++ F+ rf eal+++s+lfd+++++lpr++++r + E+e+lgr
  Cc01_g04300 387 KIDRSETVIATCLYRLRNVPDETSVTSSARDTVLHLIKKINPDLFVHGILNGTYNAPFFVMRFREALHHFSSLFDMFDKTLPRHDQDRLVFEKEVLGR 484
                  ************************************************************************************************** PP

         GRAS 293 eivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpL 367
                  + +nv+aceg++r+er et+++W++r e+aGF+++pl+++++k++++ ++  + + + v+e+ +++++gWk+r L
  Cc01_g04300 485 QSMNVIACEGTARIERPETYKQWQARNERAGFRQIPLNKDIVKEVRAKVKLQYHKDFLVDEDGKWILQGWKGRIL 559
                  *****************************************************888****************987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098561.697170546IPR005202Transcription factor GRAS
PfamPF035141.6E-105196559IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 579 aa     Download sequence    Send to blast
MAGFEYDFPS ITSSGPLNLS NFPVDALPGE HDGDSSLPGD TFGAVLQYLN QMLMGEDLEQ  60
RPCMYQELSA LQAAEKSFYD ALTGVEIIKV ADEKEKERDK FTNGSAKKRN HYRQDDEHVE  120
AGRATKQFAS YAEEPTEIFD EALLCSSSNA GSWDLSYCEA KPGRRNNQQI GLTQEPKRGR  180
PRASERKLNI REVVDMRSLL TQCAQSISDF DNRTANQLLN QIRQHSSPHG DGNERLAHYF  240
ADALEARVSG MGTTMYTAFS TRVSAADTLK AYQAYILASP FRKMSNILTG KTIQKLTTEA  300
SQIHIIDFGI LYGFHWPCFI QALSKRPRGP PKLCITGIDL PQPGLRPAER VEETGRRLAH  360
YCKKFNVPFE YHAIARKWET ISLEELKIDR SETVIATCLY RLRNVPDETS VTSSARDTVL  420
HLIKKINPDL FVHGILNGTY NAPFFVMRFR EALHHFSSLF DMFDKTLPRH DQDRLVFEKE  480
VLGRQSMNVI ACEGTARIER PETYKQWQAR NERAGFRQIP LNKDIVKEVR AKVKLQYHKD  540
FLVDEDGKWI LQGWKGRILQ GWKGIVFDAL SRWNFVQE*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-4320255525368Protein SCARECROW
5b3h_A1e-4320255524367Protein SCARECROW
5b3h_D1e-4320255524367Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027096365.10.0scarecrow-like protein 30 isoform X1
SwissprotQ9XE581e-163SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A068V2I60.0A0A068V2I6_COFCA; Uncharacterized protein
STRINGMigut.D02076.1.p0.0(Erythranthe guttata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G46600.11e-165GRAS family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]