PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_08801_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 458aa    MW: 52430.2 Da    PI: 7.8175
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_08801_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS283.46.6e-87844561374
                        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl 83 
                                 l++ Ll +A+ v++ +++ a + L +l  ++s  gd++qR++a+f+ +L ar++   s  y+ +  ++++    ++++ a++ 
  Cotton_A_08801_BGI-A2_v1.0  84 LIHSLLIIATSVDKYNMNSALENLIQLYPTVSLMGDSVQRVVAHFADGLFARILTPKSPFYDMVMKEPST----EQQFLAFTS 162
                                 6899***************************************************999999988777776....677788899 PP

                        GRAS  84 fsevsPilkfshltaNqaIleavege......ervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeelee 160
                                 ++ vsP+++f+h+taNq I ea+e++      +++H+iDf++s+G+QWp+L+q+L++ +++  slR+Tg g+    s eel+e
  Cotton_A_08801_BGI-A2_v1.0 163 LYRVSPYYQFAHFTANQVIIEAFEKDqetnnnRALHVIDFNVSYGFQWPSLIQSLSQ-SGKRVSLRLTGYGR----SLEELQE 240
                                 ***********************998667776788********************98.78899*********....9****** PP

                        GRAS 161 tgerLakfAee.lgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvv 242
                                 t+ rL +fA+  +++ fef+ l  ++ ++l +++ r k++E++aVnlv++l +++    ++    +++Lk v sl+P +v++v
  Cotton_A_08801_BGI-A2_v1.0 241 TEARLVSFAKGfCNLVFEFQGL-LRSSSKLIINQ-REKKNETVAVNLVFHLSNFM----EM----SKTLKSVHSLKPSIVILV 313
                                 *********97369*******6.66666664555.7899***************9....44....459*************** PP

                        GRAS 243 eqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFk 325
                                 eqe +    +Fl rf+e+l+y++a+fdsl+  lp+es er ++Er  lg+ei+ ++  e+ +    +e++++W++++e+ GF+
  Cotton_A_08801_BGI-A2_v1.0 314 EQEGNPRVRNFLSRFMESLHYFAAMFDSLDDCLPQESTERLSIERNHLGKEIKAMINSENLD----EENKGTWKNMIERHGFG 392
                                 *************************************************************9....8999************* PP

                        GRAS 326 pvplsekaakqaklllrkvks.......dg......yrve.eesgslv.lgWkdrpLvsvSaWr 374
                                  ++ls+k   qaklll+           +g      +rv   ++g ++ lgW+dr L+++SaW+
  Cotton_A_08801_BGI-A2_v1.0 393 GMKLSSKCLIQAKLLLKVRTHnycplpcEGeninggFRVFqRDEGKALsLGWQDRCLLTASAWQ 456
                                 ****************865530111111221111125554333555555**************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098546.62758421IPR005202Transcription factor GRAS
PfamPF035142.3E-8484456IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 458 aa     Download sequence    Send to blast
MVMEDRDEEE EHLNLSLAIV TDPNGENSRK RKRKLLNVCN PLNPSLEGYF EGKIFKLLQV  60
REEMLKLDHK RSKGLAENGK GLHLIHSLLI IATSVDKYNM NSALENLIQL YPTVSLMGDS  120
VQRVVAHFAD GLFARILTPK SPFYDMVMKE PSTEQQFLAF TSLYRVSPYY QFAHFTANQV  180
IIEAFEKDQE TNNNRALHVI DFNVSYGFQW PSLIQSLSQS GKRVSLRLTG YGRSLEELQE  240
TEARLVSFAK GFCNLVFEFQ GLLRSSSKLI INQREKKNET VAVNLVFHLS NFMEMSKTLK  300
SVHSLKPSIV ILVEQEGNPR VRNFLSRFME SLHYFAAMFD SLDDCLPQES TERLSIERNH  360
LGKEIKAMIN SENLDEENKG TWKNMIERHG FGGMKLSSKC LIQAKLLLKV RTHNYCPLPC  420
EGENINGGFR VFQRDEGKAL SLGWQDRCLL TASAWQCV
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-499245527378Protein SCARECROW
5b3h_A1e-499245526377Protein SCARECROW
5b3h_D1e-499245526377Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12833RKRKRK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017633320.10.0PREDICTED: scarecrow-like protein 23
TrEMBLA0A1U8NZ710.0A0A1U8NZ71_GOSHI; scarecrow-like protein 23
TrEMBLA0A2P5XYH80.0A0A2P5XYH8_GOSBA; Uncharacterized protein
STRINGGorai.005G075100.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15761916
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03450.11e-51RGA-like 2