PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cc07_g10300
Common NameGSCOC_T00037135001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Coffeeae; Coffea
Family GRAS
Protein Properties Length: 431aa    MW: 48274 Da    PI: 4.7521
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cc07_g10300genomeCGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS288.91.4e-88274232373
         GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykal.ppsetseknsseelaalkl..fsevsPilkfshl 96 
                  ++lLl+cA a++s+d +laq++++ l++ asp gdp+qRl++ f++AL +r++r +++ ++   ++s+++     + +++++l  ++++ P+++f++ 
  Cc07_g10300  27 EKLLLHCASALESNDVTLAQQVMWVLNNVASPVGDPNQRLTSWFLKALISRVSRVCPTPMNLNgNSSPQR-----RLMTVTELagYVDLIPWHRFGFC 119
                  689***************************************************8877776552333333.....22223333449************ PP

         GRAS  97 taNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg...skeeleetgerLakfAeelgvpfefnvl.......... 181
                  + N+aI +a++g ++vHi+D++i++++QWp+L+++La+Rpegpps+Ri + +   ++    + + ee+g+rL +fA+  +vpfefnv+          
  Cc07_g10300 120 ASNSAIFKAIQGYNKVHILDLSITHCMQWPTLIDTLAKRPEGPPSVRISVPSWRPPVpplLNVSSEEVGQRLGNFAKFKDVPFEFNVIgeqptvpssp 217
                  *************************************************988866668899999****************************999888 PP

         GRAS 182 ......vakrledl............eleeLrvkpgEalaVnlvlqlhrlldesvsles....erdevLklvkslsPkvvvvveqeadhnsesFlerf 257
                                            ++++L+++ +Eal++n++  l +l+de+ + +s    +rd +L+ +k+l+P +++vv+++++++ +++  r+
  Cc07_g10300 218 cistleC------SslqydfllnqvlNPSTLKLEDDEALVINCQNWLRYLPDEQNNGASqeisCRDIFLNRIKDLNPCIITVVDEDSELDVSNLTSRI 309
                  5544430......2223334455555888889999****************9988766656779********************************** PP

         GRAS 258 lealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveees 355
                  + +++y +  fd+le+ l++++++ri +E+  +g +i+n++a eg++r+er e+  k ++r+++ GF +vp+ e++ +++k ll +++  g+ +++e+
  Cc07_g10300 310 TTCFNYLWIPFDALETFLSKDNSQRIEYEAD-IGHKIENIIAFEGTQRIERLESGTKFSQRMRNNGFFSVPFCEETISEVKFLLDEHA-SGWGMKKED 405
                  *******************************.********************************************************.8999***** PP

         GRAS 356 gslvlgWkdrpLvsvSaW 373
                  + lvl+Wk+++ v ++aW
  Cc07_g10300 406 DMLVLTWKGHNSVYATAW 423
                  ****************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098543.681405IPR005202Transcription factor GRAS
PfamPF035144.9E-8627423IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 431 aa     Download sequence    Send to blast
MRESPQSSIL SGALKGCLGS LDGACIEKLL LHCASALESN DVTLAQQVMW VLNNVASPVG  60
DPNQRLTSWF LKALISRVSR VCPTPMNLNG NSSPQRRLMT VTELAGYVDL IPWHRFGFCA  120
SNSAIFKAIQ GYNKVHILDL SITHCMQWPT LIDTLAKRPE GPPSVRISVP SWRPPVPPLL  180
NVSSEEVGQR LGNFAKFKDV PFEFNVIGEQ PTVPSSPCIS TLECSSLQYD FLLNQVLNPS  240
TLKLEDDEAL VINCQNWLRY LPDEQNNGAS QEISCRDIFL NRIKDLNPCI ITVVDEDSEL  300
DVSNLTSRIT TCFNYLWIPF DALETFLSKD NSQRIEYEAD IGHKIENIIA FEGTQRIERL  360
ESGTKFSQRM RNNGFFSVPF CEETISEVKF LLDEHASGWG MKKEDDMLVL TWKGHNSVYA  420
TAWVPCGFDD *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_B4e-532042581474Protein SHORT-ROOT
5b3h_B1e-532042527420Protein SHORT-ROOT
5b3h_E1e-532042527420Protein SHORT-ROOT
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027063838.10.0scarecrow-like protein 32 isoform X2
RefseqXP_027172564.10.0scarecrow-like protein 32 isoform X2
TrEMBLA0A068U1910.0A0A068U191_COFCA; Uncharacterized protein
STRINGEOX992080.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA103032227
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G49950.11e-124GRAS family protein