PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_5027_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family GRAS
Protein Properties Length: 502aa    MW: 54714.8 Da    PI: 4.9885
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_5027_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS361.71.1e-1101335022374
           GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlt 97 
                    +++++e+A+avs+g+   + ++L+rls+ ++++g++ qRl  y+  AL++r++     + +  p+ e      +e++ +++l++++sP++k+++++
  Neem_5027_f_1 133 KQTVIEAATAVSEGKYDVTAEILTRLSQACNSKGNSEQRLMEYMCSALKSRVNP----VENPPPVAEL---FGVEHVGSTQLLYDLSPCFKLAFMA 221
                    57899************************************************9....4433333333...49*********************** PP

           GRAS  98 aNqaIleavege...ervHiiDfdisqGlQWpaLlqaLasRpegpp.slRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrledl 189
                    aN aIlea+  +   +++H+iDfdi+qG+Q+++Ll+aL++R++g+p  ++iT+v++ ++g +e+ +++g+ L+++Ae++gv ++fnv v+ +l dl
  Neem_5027_f_1 222 ANLAILEATLDQttsNKIHVIDFDIGQGGQYMNLLHALSARQNGKPfIVKITAVAD-NTGCEEKFKAVGDMLSQVAERVGVCLHFNVRVSPKLGDL 316
                    ********97776779**************************9999899*******.778***************************7777***** PP

           GRAS 190 eleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikv 285
                    + ++L ++p+E laVn++++l+r++desvs+e++rde+L+ vk+l P+vv++veqe+++n+++Fl r+ ea+ yy alfds+e+ +pr++ er kv
  Neem_5027_f_1 317 SRDSLGCEPDEPLAVNFAFKLFRMPDESVSTENPRDELLRRVKGLTPRVVTLVEQEMNTNTAPFLGRVNEACGYYGALFDSMESFVPRDNVERFKV 412
                    ************************************************************************************************ PP

           GRAS 286 ErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk..sdgyrveeesgslvlgWkdrpLvsvSaWr 374
                    E+  lgr++ n vaceg++r+er+e ++kWr+r+++aGF+  p+s+++a++ ++ l+  +  + g++v+ee+g +++gW +rpL+++SaWr
  Neem_5027_f_1 413 EAG-LGRKLANSVACEGRDRVERCEVFGKWRARMSMAGFELKPMSQNIAESLRTRLSSGSrvNPGFTVKEENGGVCFGWLGRPLTVASAWR 502
                    ***.*************************************************99988776789**************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.639106480IPR005202Transcription factor GRAS
PfamPF035143.7E-108133502IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0005737Cellular Componentcytoplasm
Sequence ? help Back to Top
Protein Sequence    Length: 502 aa     Download sequence    Send to blast
MSSGFPGGGG IPDLYSSLAG RSIAMNDNPS QPSYRSSQLP GIFLDPTTVQ AQDSEKKMLN  60
RLQELEKQLL DDNDEEEGDA VSVITNTNSE WSETIQNLIT FSPKQSVTPI SPSPTSSSSS  120
SSSVASPVSS CSKQTVIEAA TAVSEGKYDV TAEILTRLSQ ACNSKGNSEQ RLMEYMCSAL  180
KSRVNPVENP PPVAELFGVE HVGSTQLLYD LSPCFKLAFM AANLAILEAT LDQTTSNKIH  240
VIDFDIGQGG QYMNLLHALS ARQNGKPFIV KITAVADNTG CEEKFKAVGD MLSQVAERVG  300
VCLHFNVRVS PKLGDLSRDS LGCEPDEPLA VNFAFKLFRM PDESVSTENP RDELLRRVKG  360
LTPRVVTLVE QEMNTNTAPF LGRVNEACGY YGALFDSMES FVPRDNVERF KVEAGLGRKL  420
ANSVACEGRD RVERCEVFGK WRARMSMAGF ELKPMSQNIA ESLRTRLSSG SRVNPGFTVK  480
EENGGVCFGW LGRPLTVASA WR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A3e-441215021375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006490660.10.0scarecrow-like protein 8
SwissprotQ9FYR70.0SCL8_ARATH; Scarecrow-like protein 8
TrEMBLA0A067ESB80.0A0A067ESB8_CITSI; Uncharacterized protein
TrEMBLA0A2H5NZY60.0A0A2H5NZY6_CITUN; Uncharacterized protein
STRINGXP_006490660.10.0(Citrus sinensis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43502856
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52510.11e-179SCARECROW-like 8