PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA05g03110
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family GRAS
Protein Properties Length: 743aa    MW: 83318.8 Da    PI: 6.0418
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA05g03110genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS388.57.3e-1193707402374
        GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaNq 100
                 ++ L+ cA+av+++d + a+++L+++++++sp+gd mqRla+yf+ +L+ar+a+s++++y al + +ts    ++ l+a++l+  ++P+ k+s++  N+
  CA05g03110 370 RTILTLCAQAVAADDRRTANEFLKQIRQNSSPTGDGMQRLAHYFADGLEARMAGSGTQIYTALISMPTS---AADILKAYQLYLAACPFRKLSNFFSNK 465
                 67899**********************************************************999998...9************************** PP

        GRAS 101 aIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeLrvk 197
                 +I++a+e++++vH+iD++i++G+QWp+++q L+ Rp+gpp+lRiTg++ p++g   +e++eetg+rLa++Ae+++vpfef + +a+++e++++e+L+++
  CA05g03110 466 TIMNAAETASTVHVIDYGIMYGFQWPCFIQRLSRRPGGPPKLRITGIDFPNPGfrPAERVEETGKRLADYAESFNVPFEFIA-IAQKWETIKVEDLKIQ 563
                 ****************************************************9*****************************.7*************** PP

        GRAS 198 pgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivn 296
                 +gE+l+Vn++ +  +lldesv ++s+rd vL+++++l+P+v v    +  +n++ F +rf eal +ys++fd+lea++pre  er  vE+ ++gre++n
  CA05g03110 564 KGEVLVVNCMNRFRNLLDESVVINSPRDIVLNFIRKLNPDVYVQGIVNGAYNAPFFITRFREALFHYSSIFDMLEANIPREIPERLLVEKLIFGREAMN 662
                 *************************************************************************************************** PP

        GRAS 297 vvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                 v+ace aer+er et+++W+ r ++aGFk++pl+e++   ak  ++ ++ + + ++ + ++l++gWk+r L+++S W+
  CA05g03110 663 VIACESAERIERPETYKQWQVRNTRAGFKQLPLNEEILRIAKDRVKAYHHKDFIIDVDGHWLLQGWKGRILFTASTWT 740
                 *************************************************888*************************5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098568.971343720IPR005202Transcription factor GRAS
PfamPF035142.5E-116370740IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 743 aa     Download sequence    Send to blast
MVMDSRNYMG LYDTTSAIQL KDDANSFFQD LNLMNNLRVS DALVERNRVA VPQFQPDLKP  60
DDVVPSAIDN SHEDYDFSDA VLKYISQMLM EENIEEKACM FQESAALQAA ERSFYEVIGE  120
KYPPSPNEKI LDLGQDVGHC VLDSSSNYYS CGSDVTDGLL CPNWNPDPGD TDSSHNQQFP  180
VDSAFASTSQ SSHSSSSSSG TVTDAHVDSP VSSIHIPDIF SDSESIMQFK KGVEEASKFL  240
PTGNSLLLDV RYNVVVKEDN ENGKDAVENR GKQKSPEGSR RKKNYHHDDF DVMEKRSNKQ  300
SAVSSESPVR SDLFDKVLLC SGGKNESALR ESWQTLSSKH APEDSLPKGS NGRKSRGKKP  360
GGKRGAVDLR TILTLCAQAV AADDRRTANE FLKQIRQNSS PTGDGMQRLA HYFADGLEAR  420
MAGSGTQIYT ALISMPTSAA DILKAYQLYL AACPFRKLSN FFSNKTIMNA AETASTVHVI  480
DYGIMYGFQW PCFIQRLSRR PGGPPKLRIT GIDFPNPGFR PAERVEETGK RLADYAESFN  540
VPFEFIAIAQ KWETIKVEDL KIQKGEVLVV NCMNRFRNLL DESVVINSPR DIVLNFIRKL  600
NPDVYVQGIV NGAYNAPFFI TRFREALFHY SSIFDMLEAN IPREIPERLL VEKLIFGREA  660
MNVIACESAE RIERPETYKQ WQVRNTRAGF KQLPLNEEIL RIAKDRVKAY HHKDFIIDVD  720
GHWLLQGWKG RILFTASTWT GA*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A3e-5737674026379Protein SCARECROW
5b3h_A2e-5737674025378Protein SCARECROW
5b3h_D2e-5737674025378Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1269282RGKQKSPEGSRRKK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754430.0HG975443.1 Solanum pennellii chromosome ch04, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016572122.10.0PREDICTED: scarecrow-like protein 14
RefseqXP_016572123.10.0PREDICTED: scarecrow-like protein 14
RefseqXP_016572124.10.0PREDICTED: scarecrow-like protein 14
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A1U8GLI80.0A0A1U8GLI8_CAPAN; scarecrow-like protein 14
TrEMBLA0A1U8GTL40.0A0A1U8GTL4_CAPAN; Scarecrow-like protein 14
STRINGPGSC0003DMT4000162580.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]