PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_30653_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 627aa    MW: 71211 Da    PI: 6.6731
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_30653_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS395.45.8e-1212546233374
                        GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfs 85 
                                 +lL++cA+av+ +d++ a++lL+++s+++s +gd +qRla+yf+ AL++rla+++   y+ l +++ts    ++ l+a+++++
  Cotton_A_30653_BGI-A2_v1.0 254 SLLTQCAQAVTINDQRTANELLKQISQNSSTTGDGTQRLAHYFADALKTRLAGMGAPSYSPLVSNRTS---AADILKAYRVLV 333
                                 79*********************************************************999988888...9*********** PP

                        GRAS  86 evsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLa 166
                                  ++P+ k+ h+ aN+ I++ +e+++++Hi+Df+i +G+QWp+L+q L++R++gpp+lRiTg++ p++g   +e++eetg+rL+
  Cotton_A_30653_BGI-A2_v1.0 334 LACPFKKMMHFYANKKIMKVAEKATTLHIVDFGICYGFQWPCLIQRLSARAGGPPKLRITGIEFPQPGfrPAERVEETGRRLK 416
                                 *******************************************************************9*************** PP

                        GRAS 167 kfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhn 249
                                 +++e+++vpfe+nv +ak++e+++leeL++k++E+++Vn++++l++l+d+++s++s+rd vLkl++s++P+ +++  ++  +n
  Cotton_A_30653_BGI-A2_v1.0 417 RYCEKFKVPFEYNV-IAKKWETIQLEELKIKKDEVVVVNCMYRLKNLPDDTLSSTSARDIVLKLIRSINPEFFIHGISNGTYN 498
                                 *************9.7******************************************************************* PP

                        GRAS 250 sesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsek 332
                                 ++ F++rf eal ++sa+fd +ea++pr++ +r++ Ere++gr+i+nvvaceg er+er et+++W++r  +aGFk++pl+++
  Cotton_A_30653_BGI-A2_v1.0 499 APFFVTRFREALFHFSAMFDIFEANVPRDDPQRMMFEREVIGRDIMNVVACEGIERVERPETYKQWQARTLRAGFKQIPLDQD 581
                                 *********************************************************************************** PP

                        GRAS 333 aakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                  +k++  +++  + + + ++ +  ++++gWk+r ++++S+W+
  Cotton_A_30653_BGI-A2_v1.0 582 LVKKVTNMVQSNYHRDFIIDVDGRWMLQGWKGRVIFALSCWK 623
                                 *************999*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098566.724226603IPR005202Transcription factor GRAS
PfamPF035142.0E-118254623IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 627 aa     Download sequence    Send to blast
MYAFKFDHGA VPVYSNHDFV NNNNGFIKDS PPQVVPSPDS PVESWTTTNS EAHPLDNIPF  60
ANEMLKYINE MLMEEDMEEK TCMLQDCLAL QAAEKSFYEV LGHEYPLSTD PIPAYTDQTG  120
GNPGDFDSSL IQTSLVDSLE RTSLFPDLQR GIPPSIEPSG SSLLGSKGRK NYERGDVDDL  180
EQGRSNKQSA VSLEDSEQTD MFDDILLCKG ENEDDPRCSL NESSQRLWPQ KGTSKGGTAR  240
RKNGKKSEVV DLWSLLTQCA QAVTINDQRT ANELLKQISQ NSSTTGDGTQ RLAHYFADAL  300
KTRLAGMGAP SYSPLVSNRT SAADILKAYR VLVLACPFKK MMHFYANKKI MKVAEKATTL  360
HIVDFGICYG FQWPCLIQRL SARAGGPPKL RITGIEFPQP GFRPAERVEE TGRRLKRYCE  420
KFKVPFEYNV IAKKWETIQL EELKIKKDEV VVVNCMYRLK NLPDDTLSST SARDIVLKLI  480
RSINPEFFIH GISNGTYNAP FFVTRFREAL FHFSAMFDIF EANVPRDDPQ RMMFEREVIG  540
RDIMNVVACE GIERVERPET YKQWQARTLR AGFKQIPLDQ DLVKKVTNMV QSNYHRDFII  600
DVDGRWMLQG WKGRVIFALS CWKPVKN
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A2e-532566238375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017641909.10.0PREDICTED: scarecrow-like protein 30
SwissprotP0C8830.0SCL33_ARATH; Scarecrow-like protein 33
TrEMBLA0A1U8HGK20.0A0A1U8HGK2_GOSHI; scarecrow-like protein 30
STRINGGorai.003G130400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM36462561
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G59450.10.0GRAS family protein