PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa04g051280.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family GRAS
Protein Properties Length: 856aa    MW: 96417.1 Da    PI: 5.8005
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa04g051280.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS310.24.8e-953516511305
            GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfsh 95 
                     l++lL++cA+av+++d ++a +lL++++++++p gd +qRla++f+++L+arla+++s++yk + +++ s    + +l+a++lf  ++P+ k+s+
  Csa04g051280.1 351 LRSLLIHCAQAVAADDRRCAGQLLKQIRQHSTPFGDGNQRLAHCFANGLEARLAGTGSQIYKGIVSKPRS---AAAVLKAHQLFLACCPFRKLSY 442
                     689***************************************************************9999...999******************* PP

            GRAS  96 ltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrled 188
                     +++N++I + v++++rvH+iDf+i +G+QWp+L++ ++    g+p++RiTg++ p++g   ++++eetg+rLa +A+++gvpfe+++ +ak++++
  Csa04g051280.1 443 FITNKTIRDLVDNSQRVHVIDFGILYGFQWPTLIHRFSM--YGSPKVRITGIEFPQPGfrPAQRVEETGQRLAAYAKHFGVPFEYKA-IAKKWDA 534
                     **************************************9..8***************9*****************************.7****** PP

            GRAS 189 leleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseeri 283
                     ++le+L+++++E+++Vn+ ++ ++l+desv++es+rd+vL+l+ +++P+++v+   +  +n++ F++rf eal ++s++fd+le+ +pre+e ri
  Csa04g051280.1 535 IQLEDLDIDRDEVIIVNCLYRAENLHDESVKVESCRDTVLNLIGKINPDLFVFGIVNGAYNAPFFVTRFREALFHFSSIFDMLETIVPREDEGRI 629
                     *********************************************************************************************** PP

            GRAS 284 kvErellgreivnvvacegaer 305
                     ++E e++gre+ nv+aceg er
  Csa04g051280.1 630 FLEMEVFGREALNVIACEGWER 651
                     ********************98 PP

2GRAS183.81.2e-56653850176374
            GRAS 176 fefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfds 270
                     fe+++ +ak++++++le+L+++++E+++Vn+ ++ ++l+desv++es+rd+vL+l+ +++P+++v+   +  +n++ F++rf eal ++s++fd+
  Csa04g051280.1 653 FEYKA-IAKKWDAIQLEDLDIDRDEVIIVNCLYRAENLHDESVKVESCRDTVLNLIGKINPDLFVFGIVNGAYNAPFFVTRFREALFHFSSIFDM 746
                     9****.7**************************************************************************************** PP

            GRAS 271 leaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdr 365
                     le+ +pre+e r+++E e++gre+ nv+aceg er+er et+++W+ r +++G+ +vp++ +++k+a + +++++ + + +++++ +l++gWk+r
  Csa04g051280.1 747 LETIVPREDEGRMFLEMEVFGREALNVIACEGWERVERPETYKQWHVRAMRSGLVQVPFDPSIMKTALHKVNTFYHKDFVIDQDNRWLLQGWKGR 841
                     ***************************************************************************889***************** PP

            GRAS 366 pLvsvSaWr 374
                     +++++S+W+
  Csa04g051280.1 842 TVMALSVWK 850
                     ********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098559.462325830IPR005202Transcription factor GRAS
PfamPF035141.7E-92351651IPR005202Transcription factor GRAS
PfamPF035144.2E-54653850IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 856 aa     Download sequence    Send to blast
MITEPGLTGI SAAITRNRLP GLPDQSTSLL PNHTFTPVSL YDSFNYNLSS SDHRNPPENS  60
VFVREVVVGE EEDPADDFDF SDAVLGYISQ MLNEEDMDDK VCMLQESLDL EAAERSLYEA  120
IGKKYPPSPE RNLVASFAER NGENLDRVVP GNYTAGDCIG FGNGVIKPMS GGFTIDFRNP  180
QSRCSSVLSV PQSNGFGMDQ SSKKSLYSDS NGESHQSVWL FRGGIEEASR FLPEQNELIV  240
NFREESCMSR GRKNSSRDEV CVEEERSSKL PAVFGEDILR SDVVDKILVH VPGEESMKEF  300
DALREVLKKG VEKKKASVAQ GGKRRARGRG RGRGGGGGGG GQNGKKEVVD LRSLLIHCAQ  360
AVAADDRRCA GQLLKQIRQH STPFGDGNQR LAHCFANGLE ARLAGTGSQI YKGIVSKPRS  420
AAAVLKAHQL FLACCPFRKL SYFITNKTIR DLVDNSQRVH VIDFGILYGF QWPTLIHRFS  480
MYGSPKVRIT GIEFPQPGFR PAQRVEETGQ RLAAYAKHFG VPFEYKAIAK KWDAIQLEDL  540
DIDRDEVIIV NCLYRAENLH DESVKVESCR DTVLNLIGKI NPDLFVFGIV NGAYNAPFFV  600
TRFREALFHF SSIFDMLETI VPREDEGRIF LEMEVFGREA LNVIACEGWE RXFEYKAIAK  660
KWDAIQLEDL DIDRDEVIIV NCLYRAENLH DESVKVESCR DTVLNLIGKI NPDLFVFGIV  720
NGAYNAPFFV TRFREALFHF SSIFDMLETI VPREDEGRMF LEMEVFGREA LNVIACEGWE  780
RVERPETYKQ WHVRAMRSGL VQVPFDPSIM KTALHKVNTF YHKDFVIDQD NRWLLQGWKG  840
RTVMALSVWK PESKA*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A5e-3835765625316Protein SCARECROW
5b3h_A4e-3835765624315Protein SCARECROW
5b3h_D4e-3835765624315Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1324333RARGRGRGRG
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa04g051280.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0046840.0AC004684.3 Arabidopsis thaliana chromosome 2 clone F13M22 map ve018, complete sequence.
GenBankCP0026850.0CP002685.1 Arabidopsis thaliana chromosome 2, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010505361.10.0PREDICTED: scarecrow-like protein 9 isoform X1
SwissprotO809330.0SCL9_ARATH; Scarecrow-like protein 9
TrEMBLR0HVC00.0R0HVC0_9BRAS; Uncharacterized protein
STRINGXP_010505361.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G37650.10.0GRAS family protein