PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A02G0703
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 616aa    MW: 68558.8 Da    PI: 5.0291
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A02G0703genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS445.53.7e-1362486153373
         GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaNq 100
                  + L+ecA  +s+g+ e+a+a++++l++ +s +gdp qR+aay++e+Laar+a s++ lykal+++e +   ss++laa+++++ev+P++kf++++aN 
  Gh_A02G0703 248 QMLIECAAILSEGHIEKASAIINELRQKVSIQGDPPQRIAAYMVEGLAARMAASGKYLYKALRCKEPP---SSDRLAAMQILFEVCPCFKFGFMAANG 342
                  79*****************************************************************9...9************************** PP

         GRAS 101 aIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeLrv 196
                  aI ea++ge+rvHiiDfdisqG Q+++L+q++a+ p++pp+lR+Tgv++pes+   +  le +g rL+k+Ae lgvpfef++ v +r++ + +++L++
  Gh_A02G0703 343 AIIEAFKGEKRVHIIDFDISQGSQYITLIQTIAKLPGKPPHLRLTGVDDPESVqrLNGGLEIVGLRLEKLAEILGVPFEFRA-VPSRTSLVAPSMLDC 439
                  ***************************************************99777889***********************.7************** PP

         GRAS 197 kpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgrei 294
                  kpgEal+Vn+++qlh+++desvs+ ++rd++L++vks++Pk+v+vveq++++n+++F+ rf+ea +yysa+fdsl+a+lpres++r++vEr++l+r+i
  Gh_A02G0703 440 KPGEALIVNFAFQLHHMPDESVSTINQRDQLLRMVKSMNPKLVTVVEQDVNTNTSPFFPRFIEAYSYYSAVFDSLDATLPRESQDRMNVERQCLARDI 537
                  ************************************************************************************************** PP

         GRAS 295 vnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                  vn++aceg+er+er e ++kWr+r+ +aGFk+ p+s+++ ++++ l++++ ++ y+++e+ g+l +gW+d++L+++SaW
  Gh_A02G0703 538 VNIIACEGEERIERYEVAGKWRARMIMAGFKSCPMSSNVIDTIQKLIKEYCDR-YKLKEDVGALHFGWEDKSLIVASAW 615
                  ***************************************************66.************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098565.59220597IPR005202Transcription factor GRAS
PfamPF035141.3E-133248615IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 616 aa     Download sequence    Send to blast
MSMATFLRSF WSENAETQTL SPYSQVIVQP DLTKKGSQLP GFRNLNTAEP FYNSTVQRVG  60
TMSLVRSAEP ATASCRNTKL YSIQDSSDST GMAIRMFGSD KHKSVYVMDS YSSESYEKYF  120
LDSPTDELIH SSSSGISGSS VRLQDVSSCQ IRDYSEIQSP DTLDSDSDKM KLKLQELERA  180
LLADNDVDGD DDMFGTGLSM EVDGEWSDPI RMGSHHDSPK ESSSSGSYLD CVSGDKEVSH  240
VSSQTPKQML IECAAILSEG HIEKASAIIN ELRQKVSIQG DPPQRIAAYM VEGLAARMAA  300
SGKYLYKALR CKEPPSSDRL AAMQILFEVC PCFKFGFMAA NGAIIEAFKG EKRVHIIDFD  360
ISQGSQYITL IQTIAKLPGK PPHLRLTGVD DPESVQRLNG GLEIVGLRLE KLAEILGVPF  420
EFRAVPSRTS LVAPSMLDCK PGEALIVNFA FQLHHMPDES VSTINQRDQL LRMVKSMNPK  480
LVTVVEQDVN TNTSPFFPRF IEAYSYYSAV FDSLDATLPR ESQDRMNVER QCLARDIVNI  540
IACEGEERIE RYEVAGKWRA RMIMAGFKSC PMSSNVIDTI QKLIKEYCDR YKLKEDVGAL  600
HFGWEDKSLI VASAWS
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A7e-6525261625379Protein SCARECROW
5b3h_A6e-6525261624378Protein SCARECROW
5b3h_D6e-6525261624378Protein SCARECROW
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.93020.0boll| ovule| stem
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, shoots, leaves, flowers and siliques. {ECO:0000269|PubMed:10341448, ECO:0000269|PubMed:18500650}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016742756.10.0PREDICTED: scarecrow-like protein 1 isoform X1
RefseqXP_017633559.10.0PREDICTED: scarecrow-like protein 1 isoform X1
SwissprotQ9SDQ30.0SCL1_ARATH; Scarecrow-like protein 1
TrEMBLA0A2P5Y4F20.0A0A2P5Y4F2_GOSBA; Uncharacterized protein
STRINGGorai.005G083700.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM69532744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21450.10.0SCARECROW-like 1
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]