PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10023324m
Common NameEUTSA_v10023324mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family GRAS
Protein Properties Length: 671aa    MW: 74848.9 Da    PI: 7.2224
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10023324mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS427.41.1e-1302836591373
             GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdg.dpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                      lv+l++ c ea+++++ +++++++ar  +lasp+g +pm+Rl++y+ eALa r+ar++++++++ pp+e +++ ++e+ +al+++++v+Pi+kf
  Thhalv10023324m 283 LVNLVTGCLEAIRTRNIAAINHFIARTGDLASPRGrTPMTRLISYYIEALALRVARMWPHIFHIAPPREFERTVEDESGNALRFLNQVTPIPKF 376
                      578999*****************************99***********************************99******************** PP

             GRAS  94 shltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrle 187
                       h+taN+++l+a+eg+ervHiiDfdi+qGlQWp+++q+LasR+++p ++RiTg+g+    sk el+etg+rL+ fAe+++++fef++ v++rle
  Thhalv10023324m 377 IHFTANEMLLRAFEGKERVHIIDFDIKQGLQWPSFFQSLASRSNPPQHVRITGIGE----SKLELNETGDRLHGFAEAMNLQFEFHP-VVDRLE 465
                      ********************************************************....***************************.7***** PP

             GRAS 188 dleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpresee 281
                      d++l++L+vk+gE++aVn+vlq+h++l + +  +   ++++ l++s++P  +v++eqea+hns ++ +r++++l+yysa+fd+++++l+++s  
  Thhalv10023324m 466 DVRLWMLHVKEGESVAVNCVLQMHKTLYDGTGGAI--RDFVGLIRSTNPVALVIAEQEAEHNSMQLETRVCNSLKYYSAIFDAIHTNLGTDSLI 557
                      ***************************77666655..89******************************************************* PP

             GRAS 282 rikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk..sdg.yrveee.......sgslvlgWkdr 365
                      r+k+E++l+grei+n+vaceg++r+erh  +++Wr+++e+ GF+++ +se+++ q k+llr+++  ++g ++ve++        g ++l W+d+
  Thhalv10023324m 558 RVKIEEMLFGREIRNIVACEGSHRQERHVGFRHWRRMMEQLGFRSLGVSEREVLQSKMLLRMYGsgNEGfFNVEQSdedgvggGGGVTLRWSDQ 651
                      ***************************************************************9856669999977777898778888****** PP

             GRAS 366 pLvsvSaW 373
                      pL+++SaW
  Thhalv10023324m 652 PLYTISAW 659
                      ******** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098557.36257630IPR005202Transcription factor GRAS
PfamPF035143.8E-128283659IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 671 aa     Download sequence    Send to blast
MLAGCSSSSL LSPTRRLRSE AVAATSAAVA VHFPMNTQRL DLPCSSSFSR KETPSNRPLG  60
RSISLDNNNN NSNKPIERKT GGCSLKQSIK LPPLATTRGS GDGFSWNNNN NNRGKKSLKR  120
LAEEDKEDES CLSRVKRQRG ETEDDHSRSF WFEHFTAQNT SPGLPFTLTC SGDDEEKVCF  180
APSEVISQPL PSNPNWVNSS VITELAGLGD KDIESSRPAA VKEASGSSTS ASSESHSLRH  240
RVQEPTNGSR NPYSHRGNAE ERTSENINNN NNHRNDLQRD FELVNLVTGC LEAIRTRNIA  300
AINHFIARTG DLASPRGRTP MTRLISYYIE ALALRVARMW PHIFHIAPPR EFERTVEDES  360
GNALRFLNQV TPIPKFIHFT ANEMLLRAFE GKERVHIIDF DIKQGLQWPS FFQSLASRSN  420
PPQHVRITGI GESKLELNET GDRLHGFAEA MNLQFEFHPV VDRLEDVRLW MLHVKEGESV  480
AVNCVLQMHK TLYDGTGGAI RDFVGLIRST NPVALVIAEQ EAEHNSMQLE TRVCNSLKYY  540
SAIFDAIHTN LGTDSLIRVK IEEMLFGREI RNIVACEGSH RQERHVGFRH WRRMMEQLGF  600
RSLGVSEREV LQSKMLLRMY GSGNEGFFNV EQSDEDGVGG GGGVTLRWSD QPLYTISAWA  660
IAGIGGSSSF *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-6329065926378Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10023324m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006391781.10.0scarecrow-like protein 28
SwissprotQ9CAN30.0SCL28_ARATH; Scarecrow-like protein 28
TrEMBLV4KGW40.0V4KGW4_EUTSA; Uncharacterized protein
STRINGXP_006391781.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM72962743
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G63100.10.0GRAS family protein