PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_05556_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 634aa    MW: 72764.6 Da    PI: 6.7537
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_05556_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS385.18.4e-1182606321374
                        GRAS   1 lvelLlecAeavssgdle.laqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalk 82 
                                 l++lL++cA+av++g+ +  +++lL+++++++sp gd +qRla+y++eAL+arla+++s++yk+l +++ts     + ++a+ 
  Cotton_A_05556_BGI-A2_v1.0 260 LRTLLITCAQAVAAGERNgTTSELLKQIRQHSSPFGDGNQRLAHYLAEALEARLAGTGSHIYKSLVSKRTS---AYDIMKAYL 339
                                 6789**********99987899********************************************99999...899****** PP

                        GRAS  83 lfsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetge 163
                                 l++ ++P+ k+sh+++N+ I  a  ++ r+H+iDf+i +G+QWp+L++ L+ R+egpp+lRiTg++ p++g   +e++eetg+
  Cotton_A_05556_BGI-A2_v1.0 340 LYVAACPFRKVSHFICNKSINVASRKSMRLHVIDFGILYGFQWPTLIERLSLRREGPPKLRITGIDFPQPGfkPAERVEETGR 422
                                 **********************************************************************999********** PP

                        GRAS 164 rLakfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqea 246
                                 rLa +Aee++vpf++++ +ak++e++++eeL+++++E ++Vn+ ++ ++llde+v+++s+r+ vL+l+++++P+++++   + 
  Cotton_A_05556_BGI-A2_v1.0 423 RLAAYAEEFKVPFQYKA-IAKKWETVRVEELEIEEDEFVVVNCLYRAKNLLDETVAVHSPRNLVLNLIRKINPNLFIHGIING 504
                                 *****************.7**************************************************************** PP

                        GRAS 247 dhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvpl 329
                                  +n++ F++rf eal ++s++fd+l+a +pre+ er+ +Ere+lgre+ n++ace  er+er et+++W++r+ +aGF + p+
  Cotton_A_05556_BGI-A2_v1.0 505 AYNAPFFVTRFREALFHFSSMFDMLDAIVPREDWERMLIEREILGREALNAIACESWERVERPETVKQWHARILRAGFLQQPF 587
                                 *********************************************************************************** PP

                        GRAS 330 sekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                 + +++k+a   +++++ + + ++e++ +lv+gWk+r ++++SaW+
  Cotton_A_05556_BGI-A2_v1.0 588 EREIVKEAFERVQTFYHKDFVIDEDNRWLVQGWKGRIIYALSAWK 632
                                 ****************888*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098564.066234612IPR005202Transcription factor GRAS
PfamPF035142.9E-115260632IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 634 aa     Download sequence    Send to blast
MDPRFKGLSV FQKQNFKEFH YHPSEQPASS LRHEEDLSED CDFSDSILRY INDILMEEDM  60
EDKSCMLQES LDLQAAEKSF YDVLGKKYPP SPEHNSSSFG DINGDMLQNQ TQSLNVSSIS  120
QSSYSSSSMV SLDGMLESPN STLQVPESIW QFNKGVEEAS KFIPSNVDLF GNFESYSKGR  180
KFSNRDDVTD EDERSSKQVA VCSETSVRSE MLDMVLLCSS GKPPTRFTAL RESLRDGISR  240
KVQQKGRGKK QSGKKEVVDL RTLLITCAQA VAAGERNGTT SELLKQIRQH SSPFGDGNQR  300
LAHYLAEALE ARLAGTGSHI YKSLVSKRTS AYDIMKAYLL YVAACPFRKV SHFICNKSIN  360
VASRKSMRLH VIDFGILYGF QWPTLIERLS LRREGPPKLR ITGIDFPQPG FKPAERVEET  420
GRRLAAYAEE FKVPFQYKAI AKKWETVRVE ELEIEEDEFV VVNCLYRAKN LLDETVAVHS  480
PRNLVLNLIR KINPNLFIHG IINGAYNAPF FVTRFREALF HFSSMFDMLD AIVPREDWER  540
MLIEREILGR EALNAIACES WERVERPETV KQWHARILRA GFLQQPFERE IVKEAFERVQ  600
TFYHKDFVID EDNRWLVQGW KGRIIYALSA WKPE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A5e-452496338380Protein SCARECROW
5b3h_A4e-452496337379Protein SCARECROW
5b3h_D4e-452496337379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5778781e-166JX577878.1 Gossypium hirsutum clone NBRI_GE9783 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017627452.10.0PREDICTED: scarecrow-like protein 9
SwissprotO809330.0SCL9_ARATH; Scarecrow-like protein 9
TrEMBLA0A2P5XEV70.0A0A2P5XEV7_GOSBA; Uncharacterized protein
STRINGGorai.007G376400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G37650.10.0GRAS family protein