PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AA32G00790
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Aethionemeae; Aethionema
Family GRAS
Protein Properties Length: 1451aa    MW: 163978 Da    PI: 5.6515
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AA32G00790genomeVEGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS359.54.9e-1103767491373
        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaN 99 
                 l++lL++cA+avs +d   a++l+ +++ ++sp g+  +Rla+yf++ L+arl+++++++y al++++ts   +++ l+a+++f  v+P+ k++ + aN
  AA32G00790 376 LRTLLVSCAQAVSVDDRITAHELIRQVRLQSSPLGNGAERLAHYFANSLEARLSGTGTQIYTALSSKRTS---TADMLKAYHTFISVCPFKKIAIIFAN 471
                 6789************************************************************999999...89************************ PP

        GRAS 100 qaIle..avegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeL 194
                 + ++    +++++++HiiDf+is+G+QWpaL++ La Rp+gpp+lRiTg++ p+ g   ++ + etgerLa+++++++vpfe+n+ +a+++e+++le+L
  AA32G00790 472 HSLMHlaVAKNAKTIHIIDFGISYGFQWPALIHRLAWRPGGPPKLRITGIDLPQRGfrPAQGVIETGERLARYCQRFNVPFEYNA-IAQKWETIKLEDL 569
                 **98721344559*****************************************99*****************************.7************ PP

        GRAS 195 rvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgre 293
                 ++++gE +aVn+ ++l +llde+v+++ +rd+vLkl+++++P++++ +    ++n++ F++rf e l ++s+lfd+ +++l+re++ r + E+e+ gre
  AA32G00790 570 KIREGEFIAVNSLFRLRNLLDETVAVNCPRDAVLKLIRKVNPDIFIPAILSGSYNAPFFVTRFRELLFHFSSLFDMCDTNLSREDDMRLMFEKEFYGRE 668
                 *************************************************************************************************** PP

        GRAS 294 ivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrveeesgslvlgWkdrpLvsvSaW 373
                 i+nvvaceg+er+er e++++W++r ++aGF+p+pl+ k +++ k+ ++  + s+ + ++ + ++l++gW++r +++ +aW
  AA32G00790 669 IMNVVACEGSERVERPESYKQWQARAMRAGFRPLPLDLKLVQKLKMTVESGYnSKDFDIDRDGNWLLQGWRGRIVYASCAW 749
                 ************************************************99888999************************* PP

2GRAS360.72.1e-110107414484373
        GRAS    4 lLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlar.svselykalppsetseknsseelaalklfsevsPilkfshltaN 99  
                  lL+ cA+ +s+gd++ a+ +L ++++++sp gd+ qRla+yf++AL+arl++ +++ + +  ++ ++ +k+ +++l+a++++   sP++++ ++   
  AA32G00790 1074 LLTLCAQSISTGDKTTAHDVLLQIKQQSSPLGDASQRLAHYFANALEARLQGsTGPVIQNYYNALTSMKKTAADTLKAYQVYLSSSPFMTLMYFFSS 1170
                  7999************************************************4555566666888888888************************** PP

        GRAS  100 qaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeL 194 
                   +Il+a+  ++ +HiiDf+i +G+QWp ++q L++R++ p++lRiTg++ p+ g    e++eetg+rLa+++++++vpfe+n++v++++e++++e+L
  AA32G00790 1171 GMILDAAGDASVLHIIDFGILYGFQWPMFIQHLSNRKGVPHKLRITGIDLPQRGfrPGEKIEETGRRLAEYCKRFNVPFEYNAIVSQNWETIRIEDL 1267
                  ****************************************************99*9***************************************** PP

        GRAS  195 rvkpgEalaVnlvlqlhrlldesvsles.erdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErell 290 
                  +++p+E+laVn  ++l++llde++  ++ +rd++Lkl+++++P+v+v++  + + n++ F  rf eal +ysalfd+++a+lpr++ er+  Ere+ 
  AA32G00790 1268 KIQPNEVLAVNAGIRLKNLLDETGGEDNcPRDALLKLIRNINPDVFVHTILNGSFNAPFFISRFKEALFHYSALFDMFDATLPRDNPERVRFEREFY 1364
                  ************************999999******************************************************************* PP

        GRAS  291 greivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrveeesgslvlgWkdrpLvsvSaW 373 
                  gre++nv+ace+++r+er et+++W+ r+ +aGFk+ p++ + +k+ k  l++    + + v+e+s++l++gWk+r L++ S+W
  AA32G00790 1365 GRELMNVIACEESDRVERPETYRQWQVRMIRAGFKQKPVKAELVKSFKEKLKRWRyHKDFVVDENSKWLLQGWKGRILFASSCW 1448
                  ****************************************************9998999************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098566.925350729IPR005202Transcription factor GRAS
PfamPF035141.7E-107376749IPR005202Transcription factor GRAS
PROSITE profilePS5098563.29910451428IPR005202Transcription factor GRAS
PfamPF035147.2E-10810741448IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0042802Molecular Functionidentical protein binding
Sequence ? help Back to Top
Protein Sequence    Length: 1451 aa     Download sequence    Send to blast
MGSYSFRSEF PGSLEEIDFN NLSNFPVSNE ALGLGNGFYL DDQPLLDLAS LNIPSALPEP  60
YSQNSVPADS SSFSDDPDFS DSVLKYISQV LMEEDMAEKP CMFHDALSLQ AAEKSLYEAL  120
GEKYPYSPDQ QPHTSPGQWD YSPDVSFSSD YGNTSNITTS SDSHWSLDGL ENRPSWLQTP  180
IPNNFVFHSS SRSTPRGLNR DTGGGNNGVL SSGFSDDLVS SMFNDSDLAI QFRRGVEEAS  240
KFLPKSSQLL INPASGSKEK GSKEGEREHE SVPYRLTGKK SHWREDEHLA EERSNKQSAV  300
YVEENELSEM FDKVLLFGRP KEQPVCILHE TFPKDPTKFS SFKTQKGETT QPSETKNNGK  360
KLAASGNNSK QDTADLRTLL VSCAQAVSVD DRITAHELIR QVRLQSSPLG NGAERLAHYF  420
ANSLEARLSG TGTQIYTALS SKRTSTADML KAYHTFISVC PFKKIAIIFA NHSLMHLAVA  480
KNAKTIHIID FGISYGFQWP ALIHRLAWRP GGPPKLRITG IDLPQRGFRP AQGVIETGER  540
LARYCQRFNV PFEYNAIAQK WETIKLEDLK IREGEFIAVN SLFRLRNLLD ETVAVNCPRD  600
AVLKLIRKVN PDIFIPAILS GSYNAPFFVT RFRELLFHFS SLFDMCDTNL SREDDMRLMF  660
EKEFYGREIM NVVACEGSER VERPESYKQW QARAMRAGFR PLPLDLKLVQ KLKMTVESGY  720
NSKDFDIDRD GNWLLQGWRG RIVYASCAWG FNLMMERSYS GALGMFNAFE YYDVNVLPNP  780
DLYTDLEFGI LSSSDFDRNP NICADLPSAW EPIDDSTDAL LEYVSQILME ESIGDNQSMF  840
YDSLALQKTE EMLQQVITDS QTHSFSPDSV ITDEFSRTSS DSMAASVSIE SSSSSNTVLV  900
STPLSDSGFD STMDSFHNRQ IETPVNEALV KSMFSDTESA QQFKKGVEEA KKFLPNNNQW  960
LINLQPEKNK RPSEKFSVKE EKGLDGLSES SRVRKNHDWE VLDSEETRNS KQTAGNVEDG  1020
NITEMFDKVL LLDNECDPQT VINSKNGSSK TQAVQTKKGV GKKKKNQVVD FGALLTLCAQ  1080
SISTGDKTTA HDVLLQIKQQ SSPLGDASQR LAHYFANALE ARLQGSTGPV IQNYYNALTS  1140
MKKTAADTLK AYQVYLSSSP FMTLMYFFSS GMILDAAGDA SVLHIIDFGI LYGFQWPMFI  1200
QHLSNRKGVP HKLRITGIDL PQRGFRPGEK IEETGRRLAE YCKRFNVPFE YNAIVSQNWE  1260
TIRIEDLKIQ PNEVLAVNAG IRLKNLLDET GGEDNCPRDA LLKLIRNINP DVFVHTILNG  1320
SFNAPFFISR FKEALFHYSA LFDMFDATLP RDNPERVRFE REFYGRELMN VIACEESDRV  1380
ERPETYRQWQ VRMIRAGFKQ KPVKAELVKS FKEKLKRWRY HKDFVVDENS KWLLQGWKGR  1440
ILFASSCWVP A
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A5e-4838014488374GRAS family transcription factor containing protein, expressed
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapAA32G00790
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0053153e-40AC005315.3 Arabidopsis thaliana chromosome 2 clone T9I4 map mi54, complete sequence.
GenBankCP0026853e-40CP002685.1 Arabidopsis thaliana chromosome 2, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_023639734.10.0scarecrow-like protein 33 isoform X4
SwissprotP0C8830.0SCL33_ARATH; Scarecrow-like protein 33
TrEMBLD7LKB70.0D7LKB7_ARALL; Scarecrow transcription factor family protein
STRINGscaffold_401365.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G29060.10.0GRAS family protein