PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cc01_g04310
Common NameGSCOC_T00039317001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Coffeeae; Coffea
Family GRAS
Protein Properties Length: 507aa    MW: 57483.3 Da    PI: 5.8662
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cc01_g04310genomeCGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3551.1e-1081465021374
         GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlta 98 
                  l++lL++cA+av+  d+  a++lL+++++++sp+gdp++Rla +f++AL+arla+++++ly+al+++++s    ++ l a+ ++ e++P+ ++s++ a
  Cc01_g04310 146 LRDLLTRCAHAVAIYDNWTANELLKQIRQHSSPYGDPTERLAYCFANALEARLAGMGTTLYSALTTTRAS---AADILRAYGAYLEICPFQRMSNIFA 240
                  6899************************************************************999998...9************************ PP

         GRAS  99 NqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeL 194
                  N+ I + +++ +r+HiiDf+i +G+QWp+L++ ++ Rp+gpp+lRiTgv+ p++g   +e++eetg rLa++A++++vpfefn+ vakr+++++ e+L
  Cc01_g04310 241 NKSIAKQTSTVTRIHIIDFGILYGFQWPCLIHGISLRPGGPPKLRITGVDLPQPGfrPAERIEETGGRLANYARRFNVPFEFNA-VAKRWDTITAEDL 337
                  ******************************************************9*****************************.8************ PP

         GRAS 195 rvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgr 292
                  +++++E+++Vn+                 rd+v++l+k+++P+++v+   +  ++++ F++rf eal+++s+lfd++ea+lpre+++r++ E+e++gr
  Cc01_g04310 338 AIDEDEMVVVNCLP---------------RDSVMNLIKKINPEMFVHGVLNGTYGAPFFVTRFKEALYHFSSLFDMFEATLPREDQNRSMFEKEVIGR 420
                  ************87...............688****************************************************************** PP

         GRAS 293 eivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                  e++nv+aceg+er+er  ++++W+ r ++aGFk++pl+++++ ++++ ++ ++++ + ++ + +++++gWk+r ++++S+W+
  Cc01_g04310 421 EVMNVIACEGTERIERPDSYKQWQVRNQRAGFKQLPLNSEIMREIRAKVKSYYNKDFLIDADGAWMLQGWKGRVIYALSCWK 502
                  *****************************************************999*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098560.251120482IPR005202Transcription factor GRAS
PfamPF035143.9E-106146502IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 507 aa     Download sequence    Send to blast
MDTLFHGDGQ SIKSANEFPQ LDFDQQHSLS QGNLVNECQF DHLFNNVTGL SKLSSHLDVN  60
STPCEVGDDS PVEGDYFDGV FKYLQRMLME EDDLLDKPCM LQDCLALQAA EKSFYEVLNE  120
NQPSSPSRGR PRAGQKRDSI RELVDLRDLL TRCAHAVAIY DNWTANELLK QIRQHSSPYG  180
DPTERLAYCF ANALEARLAG MGTTLYSALT TTRASAADIL RAYGAYLEIC PFQRMSNIFA  240
NKSIAKQTST VTRIHIIDFG ILYGFQWPCL IHGISLRPGG PPKLRITGVD LPQPGFRPAE  300
RIEETGGRLA NYARRFNVPF EFNAVAKRWD TITAEDLAID EDEMVVVNCL PRDSVMNLIK  360
KINPEMFVHG VLNGTYGAPF FVTRFKEALY HFSSLFDMFE ATLPREDQNR SMFEKEVIGR  420
EVMNVIACEG TERIERPDSY KQWQVRNQRA GFKQLPLNSE IMREIRAKVK SYYNKDFLID  480
ADGAWMLQGW KGRVIYALSC WKPAGL*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A4e-4915250325380Protein SCARECROW
5b3h_A4e-4915250324379Protein SCARECROW
5b3h_D4e-4915250324379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027096660.10.0scarecrow-like protein 33
SwissprotQ9XE581e-162SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A068V0S20.0A0A068V0S2_COFCA; Uncharacterized protein
STRINGPGSC0003DMT4000782860.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.11e-164SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]