PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA10g04850
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family GRAS
Protein Properties Length: 769aa    MW: 86774.8 Da    PI: 6.5228
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA10g04850genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS373.33.1e-1143947661374
        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaN 99 
                 l++lL++cA+av+++d + a++lL++++ ++sp gd +qRla +f+ +L+arla+++s++ykal  ++ts    ++ l+a++l+   +P+ k+s +t N
  CA10g04850 394 LRTLLINCAQAVAADDCRSATELLKQIRRHSSPFGDGNQRLAYCFADGLEARLAGTGSQIYKALINKKTS---AADFLKAFHLYLASCPFRKISGFTSN 489
                 6789************************************************************999998...8************************* PP

        GRAS 100 qaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeLrv 196
                 ++I   + +++rvHiiDf+i +G+QWp+L+q +a+R++gpp+lRiTg++ p++g   +e++eetg+rL+++A++++vpfe+++ +ak++e+++le+L++
  CA10g04850 490 KTIITKARNASRVHIIDFGILYGFQWPTLIQRIAAREGGPPRLRITGIEFPQPGfrPAERIEETGRRLSDYAKSFNVPFEYQA-IAKKWETIRLEDLKL 587
                 *****************************************************9*****************************.7************** PP

        GRAS 197 kpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreiv 295
                 +++E l+Vn+ ++ ++l+de+v  es r+ vL+l+++++P+++++   +  ++++ F++rf e l ++salfd+lea++pre  er+ +Ere++gre+ 
  CA10g04850 588 EKDEFLVVNCLYRFKNLHDETVLAESSRTLVLNLIREINPDIFIHGIVNGAYSAPFFVTRFREVLFHFSALFDMLEANVPREFPERMLIEREIFGREAL 686
                 *************************************************************************************************** PP

        GRAS 296 nvvacegaerrerhetlekWrerleeaGFkpvplsek.aakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                 nv+aceg er+er et+++W+ rl +a F+++p++++ +++ a   +r+ + + + +++++++++lgWk+r+++++S+W+
  CA10g04850 687 NVIACEGWERVERPETYKQWQVRLLRARFTQIPFDQQgIMNMAIEKVRTSYHKDFVIDQDNKWMLLGWKGRTIYALSCWT 766
                 **********************************98835666677888888888*************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098564.567368745IPR005202Transcription factor GRAS
PfamPF035141.1E-111394766IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 769 aa     Download sequence    Send to blast
MDPRFNRFPS SVNGFNLENQ SLLDFSGNWK NKEPIIGAIY PDQILANGPR FENSFVQHNV  60
GGVQSFSNAP SPTSNIALTS KIGIEDDYNE DFDFSDTVLS YINQMLMEED MEDKTHMLQE  120
SLELQAKERS FYEALGKKYP PSPQQNLSIT DQNGEILDDY CSGSLYSCTS NVGSSGGYLI  180
DPRVDSIPND HNSSYEQGIS ICNGTYSSIS SSNSINNLVD GFLDSPVGPL HIPDIYNDSH  240
PIWNFRKGVE EATKFLPTNN KLLDNVIIND LLPQEKRGES GCAATQVEKR DGGGTSLTGP  300
RGRKNAHRDD KDLEEERSSK QAAVYTESTV RSEEFDVVLL HSMGDGREAL TAYRESLKNA  360
RTKTTAQNGQ SKGFTVGKGG RGKKQSAKKE VIDLRTLLIN CAQAVAADDC RSATELLKQI  420
RRHSSPFGDG NQRLAYCFAD GLEARLAGTG SQIYKALINK KTSAADFLKA FHLYLASCPF  480
RKISGFTSNK TIITKARNAS RVHIIDFGIL YGFQWPTLIQ RIAAREGGPP RLRITGIEFP  540
QPGFRPAERI EETGRRLSDY AKSFNVPFEY QAIAKKWETI RLEDLKLEKD EFLVVNCLYR  600
FKNLHDETVL AESSRTLVLN LIREINPDIF IHGIVNGAYS APFFVTRFRE VLFHFSALFD  660
MLEANVPREF PERMLIEREI FGREALNVIA CEGWERVERP ETYKQWQVRL LRARFTQIPF  720
DQQGIMNMAI EKVRTSYHKD FVIDQDNKWM LLGWKGRTIY ALSCWTPI*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A4e-503837677379Protein SCARECROW
5b3h_D4e-503837677379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754490.0HG975449.1 Solanum pennellii chromosome ch10, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016553556.10.0PREDICTED: scarecrow-like protein 9
SwissprotO809330.0SCL9_ARATH; Scarecrow-like protein 9
TrEMBLA0A2G2YL990.0A0A2G2YL99_CAPAN; Scarecrow-like protein 9
STRINGSolyc10g086530.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G37650.10.0GRAS family protein