PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cc08_g14050
Common NameGSCOC_T00030270001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Coffeeae; Coffea
Family GRAS
Protein Properties Length: 685aa    MW: 76446.2 Da    PI: 6.0092
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cc08_g14050genomeCGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS432.72.7e-1323016671373
         GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlta 98 
                  lv+lL++cA+a++s++  ++++++arl elasp+g+p+ Rl+ayfteAL+ r+ar +++++++ pp++ + +  ++  ++l+l+++vsPi++f h+t 
  Cc08_g14050 301 LVSLLVACADAIASKNILAINHFIARLGELASPRGSPISRLTAYFTEALSLRVARFWPHIFHISPPRDLD-RVDDDCGTSLRLLNQVSPIPRFIHFTS 397
                  689***********************************************************99999997.5667778899***************** PP

         GRAS  99 NqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrledleleeLrv 196
                  N+ +l+a+eg++rvHiiDfdi+qGlQWp+L+q+LasR+++p ++RiTg+g+    sk+el etg+rLa fAe+l++pfef++ v++rled++l++L+v
  Cc08_g14050 398 NEILLRAFEGKDRVHIIDFDIKQGLQWPSLFQSLASRTNPPSHIRITGIGE----SKQELIETGDRLAGFAEALNLPFEFHP-VVDRLEDVRLWMLHV 490
                  ***************************************************....***************************.7************** PP

         GRAS 197 kpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgrei 294
                  k++E++aVn+++qlh++l +s+      +++L l++s++P+vv + eqea+hn++ F +r++++l+yysa+fdsle++++++s  r+k+E++ ++rei
  Cc08_g14050 491 KEKESVAVNCIFQLHKMLYDSTGGVL--RDFLGLIRSTNPTVVLMGEQEAEHNGPGFETRLTNSLKYYSAIFDSLEESIAADSPIRMKIEEM-FAREI 585
                  ******************77766655..89**************************************************************.***** PP

         GRAS 295 vnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrve...eesgslvlgWkdrpLvsvSaW 373
                  +n++aceg+er+erh+ + kW++++e+ GF++++ +e++  q ++ll++++s+ y+ve   e+ ++l+l W d+pL+++SaW
  Cc08_g14050 586 RNIIACEGQERFERHQGFVKWQKMMEQGGFRSLNTGERELLQSRMLLKMYGSENYKVEkqgEDGAALTLSWLDQPLYTISAW 667
                  ***************************************************999****6655577788************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098560.396275645IPR005202Transcription factor GRAS
PfamPF035149.5E-130301667IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 685 aa     Download sequence    Send to blast
MLAGCSSTLL SPRHRLRSEP PAQFQACHFP SMSTQRLDLP CSFTRKDTAR NQTIRPVGLS  60
VDKPIEAKTS SCSLKQNIRL PPSATTAQTG AYIEGSRKEN RDEFWEKNKS LKRYAEQGPF  120
AGDDDESCMN RAKRKRGNGK SQDFPEEEKK LTLGQLGSGS FWLQQPPFDV PRSVPLIAGL  180
SSPQIPLSLS YSGDEDRVCF VPSDVISPPL PLSNNPWVES VVTQITDLGD KNVETGQGPA  240
KEASASSTSS ESQGLVLRLN ENPTEHEIGN GSKRPNTSEI AEVVAGQKDD DNHREHDGFE  300
LVSLLVACAD AIASKNILAI NHFIARLGEL ASPRGSPISR LTAYFTEALS LRVARFWPHI  360
FHISPPRDLD RVDDDCGTSL RLLNQVSPIP RFIHFTSNEI LLRAFEGKDR VHIIDFDIKQ  420
GLQWPSLFQS LASRTNPPSH IRITGIGESK QELIETGDRL AGFAEALNLP FEFHPVVDRL  480
EDVRLWMLHV KEKESVAVNC IFQLHKMLYD STGGVLRDFL GLIRSTNPTV VLMGEQEAEH  540
NGPGFETRLT NSLKYYSAIF DSLEESIAAD SPIRMKIEEM FAREIRNIIA CEGQERFERH  600
QGFVKWQKMM EQGGFRSLNT GERELLQSRM LLKMYGSENY KVEKQGEDGA ALTLSWLDQP  660
LYTISAWWSI DVAGSSSSFS QRVD*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A2e-7130866725377Protein SCARECROW
5b3h_D2e-7130866725377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAP0092794e-35AP009279.1 Solanum lycopersicum genomic DNA, chromosome 8, clone: C08HBa0197J17, complete sequence.
GenBankAP0102634e-35AP010263.1 Solanum lycopersicum DNA, chromosome 8, clone: C08SLm0019C20, complete sequence.
GenBankHG9754474e-35HG975447.1 Solanum pennellii chromosome ch08, complete genome.
GenBankHG9755204e-35HG975520.1 Solanum lycopersicum chromosome ch08, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027080070.10.0scarecrow-like protein 28
RefseqXP_027083311.10.0scarecrow-like protein 28
RefseqXP_027180274.10.0scarecrow-like protein 28
SwissprotQ9CAN30.0SCL28_ARATH; Scarecrow-like protein 28
TrEMBLA0A068UNJ10.0A0A068UNJ1_COFCA; Uncharacterized protein
STRINGPGSC0003DMT4000078790.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA55682434
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G63100.10.0GRAS family protein