PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.18121s0006.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family GRAS
Protein Properties Length: 732aa    MW: 81932.7 Da    PI: 6.6783
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.18121s0006.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS372.17.3e-1143577261374
                  GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsP 89 
                           l++lL++cA+av+++d ++a +lL++++ ++++ gd +qRla++f+++L+arla+++s++yk + +++ s    + +l+a++lf  ++P
  Cagra.18121s0006.1.p 357 LRSLLIHCAQAVAADDRRCAGQLLQQIRLHSTQFGDGNQRLAHCFANGLEARLAGTGSQIYKGIVSKPRS---AAAVLKAHQLFLACCP 442
                           689***************************************************************9999...999************* PP

                  GRAS  90 ilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpf 176
                           + k+s++++N++I + v+++ rvH+iDf+i +G+QWp+L++ +++   g+p++RiTg++ p++g   ++++eetg+rLa +A+++gvpf
  Cagra.18121s0006.1.p 443 FRKLSYFITNKTIRDLVDKSPRVHVIDFGILYGFQWPTLIHRFSK--YGSPKVRITGIEFPQPGfrPAQRVEETGQRLAAYAKHFGVPF 529
                           ********************************************9..8***************9************************* PP

                  GRAS 177 efnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyys 265
                           e+++ +ak++++++le+L+++++E+++Vn+ ++ ++l+desv++es+rd+vL+l+ +++P+++v+   +  +n++ F++rf eal ++s
  Cagra.18121s0006.1.p 530 EYKA-IAKKWDAIQLEDLDIDRDEVTIVNCLYRAENLHDESVKVESCRDTVLNLIGKINPDLFVFGIVNGAYNAPFFVTRFREALFHFS 617
                           ****.7*********************************************************************************** PP

                  GRAS 266 alfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee 354
                           ++fd+le+ +p e+e r+ +E e++gre+ nv+aceg er+er et+++W+ r +++G+ +vp++++++k+a   +++++ + + ++++
  Cagra.18121s0006.1.p 618 SIFDMLETIVPGEDEGRMLLEMEVFGREALNVIACEGWERVERPETYKQWHVRAMRSGLVQVPFDSSIMKTALQKVHTFYHKDFVIDQD 706
                           ********************************************************************************888****** PP

                  GRAS 355 sgslvlgWkdrpLvsvSaWr 374
                           +++l++gWk+r+++++S+W+
  Cagra.18121s0006.1.p 707 NKWLLQGWKGRTVMALSVWK 726
                           *******************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098560.985331706IPR005202Transcription factor GRAS
PfamPF035142.5E-111357726IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 732 aa     Download sequence    Send to blast
MITEPGLTGI SGAVNRNRLP GLPDQSTSLL PNHTFTPVTL YDGFNFNLSS DRRNTVLAAP  60
PENSVFIREM EEEEDPADDF DFSDAVLGYI SQMLNEEDMD DKVCMLQESL DLEAAERSLY  120
EAIGKKYPPS PERNLASSFA GENLDRVVPG NYTGGDCIGF GDGGNKPLSG GFTLDFRNPQ  180
SGRSSLLSVP QSNGLVTTSS IYGDGIDESS KNTLYSDSNR ESHQSVWLFR RGIEETSRFL  240
PQQNELIVNF REESCMSRGR KNSSRDETCV EEERSSKLPA VFGEDILRSD VVDKILVHVP  300
GEESMKEFDA LREVLKNGVE KKKASVAQGG KRRPRGRGRG RGRGGGGQNG KKEVVDLRSL  360
LIHCAQAVAA DDRRCAGQLL QQIRLHSTQF GDGNQRLAHC FANGLEARLA GTGSQIYKGI  420
VSKPRSAAAV LKAHQLFLAC CPFRKLSYFI TNKTIRDLVD KSPRVHVIDF GILYGFQWPT  480
LIHRFSKYGS PKVRITGIEF PQPGFRPAQR VEETGQRLAA YAKHFGVPFE YKAIAKKWDA  540
IQLEDLDIDR DEVTIVNCLY RAENLHDESV KVESCRDTVL NLIGKINPDL FVFGIVNGAY  600
NAPFFVTRFR EALFHFSSIF DMLETIVPGE DEGRMLLEME VFGREALNVI ACEGWERVER  660
PETYKQWHVR AMRSGLVQVP FDSSIMKTAL QKVHTFYHKD FVIDQDNKWL LQGWKGRTVM  720
ALSVWKPVSK A*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A2e-4136372925382Protein SCARECROW
5b3h_A2e-4136372924381Protein SCARECROW
5b3h_D2e-4136372924381Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1332342RPRGRGRGRGR
2334343RGRGRGRGRG
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.18121s0006.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0046840.0AC004684.3 Arabidopsis thaliana chromosome 2 clone F13M22 map ve018, complete sequence.
GenBankCP0026850.0CP002685.1 Arabidopsis thaliana chromosome 2, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006296472.10.0scarecrow-like protein 9 isoform X1
SwissprotO809330.0SCL9_ARATH; Scarecrow-like protein 9
TrEMBLR0HVC00.0R0HVC0_9BRAS; Uncharacterized protein
STRINGCagra.18121s0006.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G37650.10.0GRAS family protein