PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 901864
Common NameARALYDRAFT_901864
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family GRAS
Protein Properties Length: 1322aa    MW: 150316 Da    PI: 5.4998
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
901864genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS366.63.3e-1123216941373
    GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaNqaIl 103
             l++ L++cA+avs +d + a+ lL+++++++s++gd ++Rla+yf++ L+arla+ ++++y al++++ts   +s+ l+a++++  v+P+ k++ + aN+ I+
  901864 321 LRTMLVSCAQAVSINDRRTADDLLSQIRQHSSSYGDGTERLAHYFANSLEARLAGIGTQVYTALSSKKTS---TSDMLKAYQTYISVCPFKKIAIIFANHSIM 420
             5789**************************************************************9999...89***************************9 PP

    GRAS 104 eavege..ervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeLrvkpgEal 202
             + ++++  +++HiiDf+is+G+QWp+L++ La R++++ +lRiTg++ p+ g   +e + etg+rLak++++++vpfe+n+ +a+++e+++le+L++k+gE +
  901864 421 RLASTAnaKTIHIIDFGISYGFQWPSLIHRLAWRRGSSCKLRITGIELPQRGfrPAEGVIETGHRLAKYCQKFNVPFEYNA-IAQKWETIKLEDLKLKEGEFV 522
             9877766699****************************************99*****************************.7******************** PP

    GRAS 203 aVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaer 305
             aVn+ ++  +llde+v+++s+rd+vLkl+++++P+v++      ++n++ F++rf e l +ys+lfd+ +++l re+  r++ E+e+ grei+nvvaceg+er
  901864 523 AVNSLFRFRNLLDETVAVHSPRDTVLKLIRKIKPDVFIPGILSGSYNAPFFVTRFREVLFHYSSLFDMCDTNLTREDPMRVMFEKEFYGREIMNVVACEGTER 625
             ******************************************************************************************************* PP

    GRAS 306 rerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrveeesgslvlgWkdrpLvsvSaW 373
             +er e++++W++r ++aGF+++pl+++ +++ kll++  +  +++ v+++ ++l++gWk+r ++  S+W
  901864 626 VERPESYKQWQARAMRAGFRQIPLDKELVQKLKLLVESGYkTKEFDVDQDCHWLLQGWKGRIVYGSSVW 694
             ************************************99888999************************* PP

2GRAS348.61e-10694113182373
    GRAS    2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlar.svselykalppsetsekn.sseelaalklfsevsPilkfshltaNq 100 
              ++lL++cA+a+s+gd++ a  +L ++++++sp gd+ qRla++f++AL+arl++ +++ + +  ++ +ts k+  +++l+a++++   sP++++ ++   +
  901864  941 RTLLTHCAQAISTGDKTTALDFLLQIRQQSSPLGDAGQRLAHCFANALEARLQGsTGPMIQNYYNAITTSLKDtAADTLKAYRVYLSSSPFVTLMYFFSIR 1041
              689***************************************************5555555555555555444499************************* PP

    GRAS  101 aIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeLrvkpg 199 
              +Ile ++ +  +Hi+Df+i +G+QWp ++q ++ R++ p +lRiTg++ p+ g   +e++eetg+rLa+++++++vpfe+++++++++e++ +e+L+++p+
  901864 1042 MILEVAKDAPVLHIVDFGILYGFQWPMFIQYISGRNDVPRKLRITGIELPQCGfrPAERIEETGRRLAEYCKRFNVPFEYKAIASQNWETIGIEDLDIRPD 1142
              ***************************************************9999****************************999*************** PP

    GRAS  200 EalaVnlvlqlhrlldesvsles.erdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvva 299 
              E+laVn  l+l++l de++s e+ +rd+vLkl+++++P+v++++  + + n++ F  rf ea+++ysalfd+++++lpr+++eri  Ere+ gre++nv+a
  901864 1143 EVLAVNAGLRLKNLQDETGSEENcPRDAVLKLIRNMNPDVFIHTVVNGSFNAPFFISRFKEAVYHYSALFDMFDSTLPRDNKERIRFEREFYGREAMNVIA 1243
              ***********************99**************************************************************************** PP

    GRAS  300 cegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrveeesgslvlgWkdrpLvsvSaW 373 
              ce+a+r+er et+++W+ r+ +aGF++ p++ + ++  +  l+k    + + v+e+s++l++gWk+r+L++ S+W
  901864 1244 CEEADRVERPETYRQWQVRMVRAGFRQKPIKPELVELFREKLKKWRyHKDFVVDENSKWLLQGWKGRTLYASSCW 1318
              *******************************************9998999************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098564.811295674IPR005202Transcription factor GRAS
PfamPF035141.2E-109321694IPR005202Transcription factor GRAS
PROSITE profilePS5098560.6639141298IPR005202Transcription factor GRAS
PfamPF035143.6E-1049411318IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0042802Molecular Functionidentical protein binding
Sequence ? help Back to Top
Protein Sequence    Length: 1322 aa     Download sequence    Send to blast
MGSYSAGFPG SLDWFDFPGL GNGFYLNDQP LLETGSVPPP PEPYSQQSLA SADADFSDSV  60
LKYISQVLME EDMEDKPCMF HDALSLQAAE KSLYEALGEK YAVDDSDQPL TTTSLAQLVS  120
SPDGSSYASS ITTTSSDSQW SFDCLENNRP SSWLQTPIPS NFVFQSTSTR TSPQSVVGSG  180
NAVFGSSFSG DLASNMFNDS ELALQFKKGM EEASKFLPKS SQLDNSVPYR LTGKKSHWRE  240
DEHLAEERSR KQSAVYVDET DELTEMFDKI LIFGEAKEQP VCILNENFPK EPAKASSFGK  300
SHKSEKPDAS GNSYTKETPD LRTMLVSCAQ AVSINDRRTA DDLLSQIRQH SSSYGDGTER  360
LAHYFANSLE ARLAGIGTQV YTALSSKKTS TSDMLKAYQT YISVCPFKKI AIIFANHSIM  420
RLASTANAKT IHIIDFGISY GFQWPSLIHR LAWRRGSSCK LRITGIELPQ RGFRPAEGVI  480
ETGHRLAKYC QKFNVPFEYN AIAQKWETIK LEDLKLKEGE FVAVNSLFRF RNLLDETVAV  540
HSPRDTVLKL IRKIKPDVFI PGILSGSYNA PFFVTRFREV LFHYSSLFDM CDTNLTREDP  600
MRVMFEKEFY GREIMNVVAC EGTERVERPE SYKQWQARAM RAGFRQIPLD KELVQKLKLL  660
VESGYKTKEF DVDQDCHWLL QGWKGRIVYG SSVWVPLFIQ TEYFDGNPNF LTDPMEDQYP  720
PPSDTLLKYV SEILMEESNG DYKQSMFYDS LALRKTEEML QQVITDSQNQ SFSPDSMITN  780
SWDASGSIES AYSADLQIGL PVDEFMVKSV FSDAESALQF KKGVEEASKF LPNSDQWVIN  840
LDIERPERRG LVKEEMGLDQ LRIKKNHERE IILDFEEVRS SKQFASNIED GKITEMFDKV  900
LLLDGECDPP TLLDSEIQAI RSSKNRGGKG KKKKCQVVDF RTLLTHCAQA ISTGDKTTAL  960
DFLLQIRQQS SPLGDAGQRL AHCFANALEA RLQGSTGPMI QNYYNAITTS LKDTAADTLK  1020
AYRVYLSSSP FVTLMYFFSI RMILEVAKDA PVLHIVDFGI LYGFQWPMFI QYISGRNDVP  1080
RKLRITGIEL PQCGFRPAER IEETGRRLAE YCKRFNVPFE YKAIASQNWE TIGIEDLDIR  1140
PDEVLAVNAG LRLKNLQDET GSEENCPRDA VLKLIRNMNP DVFIHTVVNG SFNAPFFISR  1200
FKEAVYHYSA LFDMFDSTLP RDNKERIRFE REFYGREAMN VIACEEADRV ERPETYRQWQ  1260
VRMVRAGFRQ KPIKPELVEL FREKLKKWRY HKDFVVDENS KWLLQGWKGR TLYASSCWVP  1320
A*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A4e-4732513188374GRAS family transcription factor containing protein, expressed
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMap901864
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0053150.0AC005315.3 Arabidopsis thaliana chromosome 2 clone T9I4 map mi54, complete sequence.
GenBankCP0026850.0CP002685.1 Arabidopsis thaliana chromosome 2, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_023639734.10.0scarecrow-like protein 33 isoform X4
SwissprotP0C8830.0SCL33_ARATH; Scarecrow-like protein 33
TrEMBLD7LKB70.0D7LKB7_ARALL; Scarecrow transcription factor family protein
STRINGscaffold_401365.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G29060.10.0GRAS family protein