PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_4272_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family GRAS
Protein Properties Length: 454aa    MW: 51052.5 Da    PI: 5.7492
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_4272_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS300.83.5e-92524512373
           GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetsekn...sseelaalkl..fsevsPilk 92 
                    ++lL++cA+a++s+d++l q++L+ l+++a pdgd++qRl+  f++AL +r a+  s+++k l++ ++++ n     +++++ +l  f++++P+++
  Neem_4272_f_1  52 EQLLVHCANAIESNDATLSQQILWVLNNIAPPDGDSNQRLTCAFLRALITRAAK--SGTCKMLAAMANAHCNlviDNHKFSVIELasFVDLTPWHR 145
                    79****************************************************..8888888777777666544333334444477********* PP

           GRAS  93 fshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg...skeeleetgerLakfAeelgvpfefnvl.... 181
                    f++ +aN aIleaveg + +Hi+D+++++++Q p+L++a+a+R egpp+l++T  g+ e        ++ee+g++L +fA++ +v +ef+v+    
  Neem_4272_f_1 146 FGFTAANAAILEAVEGYSVIHIVDLSLTHCMQIPTLIDAIANRFEGPPRLKLTIAGATEDIppmLDLSYEELGSKLVNFARSRNVMMEFRVIpssy 241
                    *****************************************************9999776698888999***********************9984 PP

           GRAS 182 ...vakrledleleeLrvkp.gEalaVnlvlqlhrll................desvsleserdevLklvkslsPkvvvvveqeadhnsesFlerf 257
                        ++ +e++++++L   + gEal++n++++lh+++                 es +++s r+ +Lk+++sl+P++v +++++ad++s++++ r+
  Neem_4272_f_1 242 adgFSSLIEQIRVQNLVYAEsGEALVINCHMMLHYIPeetvppipnansnpyfVESSCTSSIRTMFLKALRSLEPTIVLLIDEDADLTSNDLVCRL 337
                    444444445555555555559********************************777777778********************************** PP

           GRAS 258 lealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrvee 353
                     +a++y +  +d++++ lpr s++r ++E+  + ++i+nv+a+eg +r+er e +++W +r+++a F+ ++++e+a++++k++l++++  g+ +++
  Neem_4272_f_1 338 RSAFNYLWIPYDTVDTFLPRGSKQREWYEAD-ICWKIENVIAHEGLQRVERPELKSRWVQRMRNANFRGIAFGEDAVSEVKTMLEEHA-AGWGLKK 431
                    *******************************.********************************************************.8999*** PP

           GRAS 354 esgslvlgWkdrpLvsvSaW 373
                    e++ lvl+Wk+++ v+++aW
  Neem_4272_f_1 432 EEDDLVLTWKGHNAVFATAW 451
                    ******************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098543.72425433IPR005202Transcription factor GRAS
PfamPF035141.2E-8952451IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 454 aa     Download sequence    Send to blast
MMQFTETPPQ PLHQIASFSN LTMNKNPIHR TRPWPRFPSK SLGSFGDANC MEQLLVHCAN  60
AIESNDATLS QQILWVLNNI APPDGDSNQR LTCAFLRALI TRAAKSGTCK MLAAMANAHC  120
NLVIDNHKFS VIELASFVDL TPWHRFGFTA ANAAILEAVE GYSVIHIVDL SLTHCMQIPT  180
LIDAIANRFE GPPRLKLTIA GATEDIPPML DLSYEELGSK LVNFARSRNV MMEFRVIPSS  240
YADGFSSLIE QIRVQNLVYA ESGEALVINC HMMLHYIPEE TVPPIPNANS NPYFVESSCT  300
SSIRTMFLKA LRSLEPTIVL LIDEDADLTS NDLVCRLRSA FNYLWIPYDT VDTFLPRGSK  360
QREWYEADIC WKIENVIAHE GLQRVERPEL KSRWVQRMRN ANFRGIAFGE DAVSEVKTML  420
EEHAAGWGLK KEEDDLVLTW KGHNAVFATA WVPA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_B1e-464645382474Protein SHORT-ROOT
5b3h_B5e-474745329420Protein SHORT-ROOT
5b3h_E5e-474745329420Protein SHORT-ROOT
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021274418.10.0scarecrow-like protein 32 isoform X1
SwissprotQ9SN220.0SCL32_ARATH; Scarecrow-like protein 32
TrEMBLA0A061DGZ90.0A0A061DGZ9_THECC; SCL domain class transcription factor
STRINGEOX916400.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM54452750
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G49950.10.0GRAS family protein