PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.193020.1
Common NameCsa_5G140530, LOC101214171
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family GRAS
Protein Properties Length: 599aa    MW: 65372.5 Da    PI: 6.2871
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.193020.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS354.22e-1082345983374
            GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlt 97 
                     + ++e+A+a+s+g+le  +++La + ++++ +g+++qRla y++ AL++r++      ++ +pp   +    +e+ aa++l+++vsP++k+++++
  Cucsa.193020.1 234 QSVIEAATAISDGKLEGLDEILAPVVKISNARGNSVQRLAEYMVLALKSRVNP-----VE-FPPPVVE-IYGDEHSAATQLLYDVSPCFKLAFMA 321
                     6789************************************************9.....33.3332222.34999********************* PP

            GRAS  98 aNqaIleavege.ervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrledlel 191
                     aN aIlea+ +e +++H++Dfdi++G+Q+++L++ L+ R++g+ ++++T+v+  e+g  e+l+ +ge L+++A+elgv f+fn+ v ++l++l+ 
  Cucsa.193020.1 322 ANLAILEAIGEEdRKLHVVDFDIGKGGQYMNLIHLLSGRQKGKVTVKLTAVVT-ENGGDESLKLVGESLTQLANELGVGFNFNI-VRHKLAELTR 414
                     *********9997888*************************************.77899************************9.799******* PP

            GRAS 192 eeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvE 286
                     e+L ++ +E+laVn++++l+r++desvs+e++rde+L+ vksl P+vv+v+eqe+++n+++F++r++e++ yys+lfds+++++ r++++r+kvE
  Cucsa.193020.1 415 ESLGCELDESLAVNFAFKLYRMPDESVSTENPRDELLRRVKSLAPTVVTVMEQELNMNTAPFVARVTESCTYYSSLFDSIDSTVQRHHSDRVKVE 509
                     *********************************************************************************************** PP

            GRAS 287 rellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk..sdgyrveeesgslvlgWkdrpLvsvSaWr 374
                     +  lgr++ n +aceg++r+er+e  +kWr+r+++aGF++ ++s+++a+++k+ l+  +  + g++v+ee+g +++gW++r+L++++aWr
  Cucsa.193020.1 510 EG-LGRKLANSLACEGRDRVERCEVSGKWRARMGMAGFEARSMSQTVAESMKTRLSSGYrvNPGFTVKEENGGICFGWMGRTLTVTTAWR 598
                     **.*************************************************99887655889**************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.539206576IPR005202Transcription factor GRAS
PfamPF035146.9E-106234598IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005737Cellular Componentcytoplasm
Sequence ? help Back to Top
Protein Sequence    Length: 599 aa     Download sequence    Send to blast
MQSGFTGGGA PDFYTAGRSM SSNPSQQNAY RSQLSGGSFI DPAATQIARQ IPSSLLGKRN  60
LADLHSHNQH NHHNLPLNNL FLRSVKPRAF NHPISSLSNL DFYSTMTLPS PDVQAHRLYG  120
TSSGALLQQL RQQPNGGIPV RDLQSLESEK KMMNHRLQEL EKELLEDNDD DDGSDAVSVI  180
TSSNSAWCET IYNLISPNPS PTAQNPSPTS SASSSCSSST SSSVASPASD SWKQSVIEAA  240
TAISDGKLEG LDEILAPVVK ISNARGNSVQ RLAEYMVLAL KSRVNPVEFP PPVVEIYGDE  300
HSAATQLLYD VSPCFKLAFM AANLAILEAI GEEDRKLHVV DFDIGKGGQY MNLIHLLSGR  360
QKGKVTVKLT AVVTENGGDE SLKLVGESLT QLANELGVGF NFNIVRHKLA ELTRESLGCE  420
LDESLAVNFA FKLYRMPDES VSTENPRDEL LRRVKSLAPT VVTVMEQELN MNTAPFVARV  480
TESCTYYSSL FDSIDSTVQR HHSDRVKVEE GLGRKLANSL ACEGRDRVER CEVSGKWRAR  540
MGMAGFEARS MSQTVAESMK TRLSSGYRVN PGFTVKEENG GICFGWMGRT LTVTTAWR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A7e-4425559826375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818950.0LN681895.1 Cucumis melo genomic scaffold, anchoredscaffold00005.
GenBankLN7132630.0LN713263.1 Cucumis melo genomic chromosome, chr_9.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011654649.10.0PREDICTED: scarecrow-like protein 8
SwissprotQ9FYR71e-166SCL8_ARATH; Scarecrow-like protein 8
TrEMBLA0A0A0KK190.0A0A0A0KK19_CUCSA; Uncharacterized protein
STRINGXP_008437524.10.0(Cucumis melo)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF62663151
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52510.11e-147SCARECROW-like 8
Publications ? help Back to Top
  1. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  2. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  3. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]