PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.180010.1
Common NameCsa_4G061850, LOC101212169
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family GRAS
Protein Properties Length: 445aa    MW: 48766.2 Da    PI: 6.5058
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.180010.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS384.61.2e-117834423374
            GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlt 97 
                      lLl+cAe+v+ ++l++a+ lL ++sel+sp g++ +R+ ayf+ AL+ar+ +s+ ++y+ l+ ++ ++++s++ ++al+ ++ +sP++kfsh+t
  Cucsa.180010.1  83 GLLLQCAECVAIDNLQEANDLLPEISELSSPFGTSPERVGAYFAHALQARVISSCLGTYSPLTIRTLNQTQSQRIFNALQSYNSISPLIKFSHFT 177
                     58********************************************************************************************* PP

            GRAS  98 aNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrledlele 192
                     aNqaI +a++ge+rvH+iD+d++qGlQWp L++ LasRp++  slRi g+gs    s++ l++tg+rLa+fA++lg+pfef+++  k  +  ++ 
  Cucsa.180010.1 178 ANQAIFQALDGEDRVHVIDLDVMQGLQWPGLFHILASRPKKIQSLRISGFGS----SSDLLQSTGRRLADFATSLGLPFEFHPVEGKIGNLTNPG 268
                     ****************************************************....9**************************7666677779** PP

            GRAS 193 eLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvEr 287
                     +L++++gEa++V++++  h l+d ++s       +L+l+ +l+Pk++++veq+++h ++sFl rf+eal+yysalfd+l  +l+ +s er++vE+
  Cucsa.180010.1 269 QLELRSGEAVVVHWMH--HCLYDVTGSDIG----TLRLLSTLKPKIITIVEQDLSH-GGSFLGRFVEALHYYSALFDALGDSLGMDSIERHVVEQ 356
                     ***************9..888888888888....**********************.789*********************************** PP

            GRAS 288 ellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                     +l+g ei+n++a  g +r+ + + +e+W + l++ GFkp++l+ + a+qa+lll +++++gy++ ee+g+l lgWkd +L+++SaW+
  Cucsa.180010.1 357 QLFGCEIRNIIAVGGPKRTGEVK-VERWGDELKRLGFKPLSLRGNPAAQASLLLGMFPWKGYTLVEENGCLKLGWKDLSLLTASAWQ 442
                     *****************998887.9*************************************************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098555.99255422IPR005202Transcription factor GRAS
PfamPF035144.1E-11583442IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048366Biological Processleaf development
GO:0090610Biological Processbundle sheath cell fate specification
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 445 aa     Download sequence    Send to blast
MLRSLIPHSP INSTSNNSPS SMKSKRSDHH LDPDSNPSTA ADSSPDHPPS KRLNSCHDHD  60
HDHDHDPPPP LDPTDSTGLR LLGLLLQCAE CVAIDNLQEA NDLLPEISEL SSPFGTSPER  120
VGAYFAHALQ ARVISSCLGT YSPLTIRTLN QTQSQRIFNA LQSYNSISPL IKFSHFTANQ  180
AIFQALDGED RVHVIDLDVM QGLQWPGLFH ILASRPKKIQ SLRISGFGSS SDLLQSTGRR  240
LADFATSLGL PFEFHPVEGK IGNLTNPGQL ELRSGEAVVV HWMHHCLYDV TGSDIGTLRL  300
LSTLKPKIIT IVEQDLSHGG SFLGRFVEAL HYYSALFDAL GDSLGMDSIE RHVVEQQLFG  360
CEIRNIIAVG GPKRTGEVKV ERWGDELKRL GFKPLSLRGN PAAQASLLLG MFPWKGYTLV  420
EENGCLKLGW KDLSLLTASA WQPT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-1687544313380Protein SCARECROW
5b3h_A1e-1687544312379Protein SCARECROW
5b3h_D1e-1687544312379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818730.0LN681873.1 Cucumis melo genomic scaffold, anchoredscaffold00027.
GenBankLN7132610.0LN713261.1 Cucumis melo genomic chromosome, chr_7.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004147738.10.0PREDICTED: scarecrow-like protein 23
SwissprotQ9FHZ10.0SCL23_ARATH; Scarecrow-like protein 23
TrEMBLA0A0A0KVE60.0A0A0A0KVE6_CUCSA; Uncharacterized protein
STRINGXP_004166797.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF20683489
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G41920.10.0GRAS family protein
Publications ? help Back to Top
  1. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  2. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  3. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]
  4. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]
  5. Yoon EK, et al.
    Conservation and Diversification of the SHR-SCR-SCL23 Regulatory Network in the Development of the Functional Endodermis in Arabidopsis Shoots.
    Mol Plant, 2016. 9(8): p. 1197-1209
    [PMID:27353361]