PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10012936m
Common NameEUTSA_v10012936mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family GRAS
Protein Properties Length: 645aa    MW: 69382.7 Da    PI: 4.9827
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10012936mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3703.2e-1132646442374
             GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfsh 95 
                      +++++e+A+a+++g++e a+++Lar+s++ +p++++ ++l+ +++ AL++r++       + + ps+++e   +e+l +++l++e+sP++k+++
  Thhalv10012936m 264 RQTVMEIATAIAEGKTEIATEILARVSQTPNPRRNSEEKLVDFMVTALRSRINP------AEI-PSPATELYGKEHLISTQLLYELSPCFKLGF 350
                      6889*************************************************9......333.4444446799******************** PP

             GRAS  96 ltaNqaIleavege.....ervHiiDfdisqGlQWpaLlqaLasRp..egppslRiTgvgspesg.......skeeleetgerLakfAeelgvp 175
                      ++aN aIl+a+ ++      ++H+iDfdi++G+Q+++Ll++L++R+  ++pp ++iT+v +++ g        +++l ++g+ L+++ ++lg++
  Thhalv10012936m 351 MAANLAILDAAGNNddgegMTMHVIDFDIGEGGQYVNLLHELSTRRngNNPPVVKITAVTNSSDGfsvvadgGEDRLTAVGDLLSQLGDRLGIS 444
                      ***********99988877789*********************99977789**********888799***999********************* PP

             GRAS 176 fefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfd 269
                      ++fnv++++rl+dl+ e+L ++p+E laVnl+++l+r++desv++e++rde+L+ vk+l+P+vv++veqe++ n+++Fl r++e++ +y al+d
  Thhalv10012936m 445 LRFNVVASQRLSDLSRESLGCDPDEHLAVNLAFKLYRVPDESVCTENPRDELLRRVKGLKPRVVTIVEQEMNSNTAPFLGRVSESCACYGALLD 538
                      *****9999************************************************************************************* PP

             GRAS 270 sleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk..sdgyrveeesgslvlg 361
                      s+e+++p+ +++r+kvE+  +gr++ n+vaceg er+er+e ++kWr+r+++aGF+ +plsek+a+++k+ l++ +    g++v+e++g +++g
  Thhalv10012936m 539 SVESTVPSVNSDRVKVEEG-IGRKLINAVACEGIERIERCEVFGKWRMRMSMAGFELMPLSEKIAESMKSRLSNGNrvHPGFTVKEDNGGVCFG 631
                      *******************.**************************************************9998777789************** PP

             GRAS 362 WkdrpLvsvSaWr 374
                      W++r+L ++SaWr
  Thhalv10012936m 632 WMGRTLAVASAWR 644
                      ************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.461237622IPR005202Transcription factor GRAS
PfamPF035141.1E-110264644IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005737Cellular Componentcytoplasm
Sequence ? help Back to Top
Protein Sequence    Length: 645 aa     Download sequence    Send to blast
MASGFSGGGG GGPEFFGVGG RSMPGGPGTV INAGNNNPQP STYRNQIPGI FLDQIGNRVS  60
GSHGIAGKRT LADFQAAQQQ LHQQQSFHNQ AAINAFLLRS VKPRTFQNLQ SPTPTTIDLT  120
SVNDMSLFGG SSQRYGLPLL PNLRSQQQQL FGGVRMGFGS GNQTLSGVPC IEPVQNLNRV  180
EESDNMLNSL RELEKQLLDD DDDESDAKGG DDDVSVITHS NSDWIQSLVA PNPNPNPVSS  240
SSPSSSSSSS SPSTASTTTQ VCSRQTVMEI ATAIAEGKTE IATEILARVS QTPNPRRNSE  300
EKLVDFMVTA LRSRINPAEI PSPATELYGK EHLISTQLLY ELSPCFKLGF MAANLAILDA  360
AGNNDDGEGM TMHVIDFDIG EGGQYVNLLH ELSTRRNGNN PPVVKITAVT NSSDGFSVVA  420
DGGEDRLTAV GDLLSQLGDR LGISLRFNVV ASQRLSDLSR ESLGCDPDEH LAVNLAFKLY  480
RVPDESVCTE NPRDELLRRV KGLKPRVVTI VEQEMNSNTA PFLGRVSESC ACYGALLDSV  540
ESTVPSVNSD RVKVEEGIGR KLINAVACEG IERIERCEVF GKWRMRMSMA GFELMPLSEK  600
IAESMKSRLS NGNRVHPGFT VKEDNGGVCF GWMGRTLAVA SAWR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_B1e-322376442419Protein SHORT-ROOT
5b3h_E1e-322376442419Protein SHORT-ROOT
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10012936m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006401816.10.0scarecrow-like protein 8
SwissprotQ9FYR70.0SCL8_ARATH; Scarecrow-like protein 8
TrEMBLV4LHQ50.0V4LHQ5_EUTSA; Uncharacterized protein
STRINGXP_006401816.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43502856
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52510.10.0SCARECROW-like 8