PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_16824_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 804aa    MW: 89658 Da    PI: 7.5804
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_16824_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS312.59.4e-96514512373
                        GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl. 83 
                                 ++lL++cA+a++s+d++laq++L+ l+++a pdgd++qRl+  f++AL  r a+  s+++k l++ ++++ n s  ++ +++ 
  Cotton_A_16824_BGI-A2_v1.0  51 EQLLVHCANAIESNDATLAQQILWVLNNIAPPDGDSNQRLTCAFLRALIVRAAK--SGTCKMLAAMANAHCNLSIDIHTFSVi 131
                                 79****************************************************..889999988888888866666666555 PP

                        GRAS  84 ....fsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg....skeel 158
                                     f++++P+++f++ +aN aIleaveg + +Hi+D+++++++Q p+L++a+asR egpp +++T+ g ++++       ++
  Cotton_A_16824_BGI-A2_v1.0 132 elasFVDLTPWHRFGFTAANAAILEAVEGYSVIHIVDLSLTHCMQIPTLIDAIASRLEGPPLVKLTVAGGATEDvppmLDLSY 214
                                 6688*****************************************************************99988998888899 PP

                        GRAS 159 eetgerLakfAeelgvpfefnvl...vakrledleleeLrvkp......gEalaVnlvlqlhrll................de 216
                                 ee+g++L +fA++ +v +ef+++   +a+ +++l +e+Lrv++      gEal++n++++lh+l+                 e
  Cotton_A_16824_BGI-A2_v1.0 215 EELGSKLINFARSRNVVLEFRAIpstYADGFSSL-IEQLRVQHlvyaesGEALVINCHMMLHYLPeetlsplsnvnsnpysFE 296
                                 ***********************99844455555.55555555555559*******************************999 PP

                        GRAS 217 svsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvva 299
                                 + s++s r+ +Lk++++l P+vvv+v+++ad++s++++ r+  a++y +  +d++++ lp+ s++r+++E+  ++++i+nv+a
  Cotton_A_16824_BGI-A2_v1.0 297 PSSIQSLRTMFLKALRGLDPTVVVLVDEDADLTSNNLVCRLRAAFNYLWIPYDTVDTFLPQGSKQRQWYEAD-ISWKIENVIA 378
                                 9999********************************************************************.********** PP

                        GRAS 300 cegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                                 +eg +r+er e +++W +r++++GF+ v+++e+a +++k++l +++  g+ +++e++ lvl+Wk++++v+++aW
  Cotton_A_16824_BGI-A2_v1.0 379 HEGLQRVERLEPKSRWVQRMRNVGFRGVSFGEEAISEVKTMLDEHA-AGWGLKKEEDDLVLTWKGHNVVFATAW 451
                                 **********************************************.8999*********************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098544.90324433IPR005202Transcription factor GRAS
PfamPF035143.3E-9351451IPR005202Transcription factor GRAS
PfamPF127672.5E-51468731IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0070461Cellular ComponentSAGA-type complex
Sequence ? help Back to Top
Protein Sequence    Length: 804 aa     Download sequence    Send to blast
MMQFSEIPPL HQIPPFSIAT MNKNQIHRAR PWPGFPTSKG LGSFGDANCM EQLLVHCANA  60
IESNDATLAQ QILWVLNNIA PPDGDSNQRL TCAFLRALIV RAAKSGTCKM LAAMANAHCN  120
LSIDIHTFSV IELASFVDLT PWHRFGFTAA NAAILEAVEG YSVIHIVDLS LTHCMQIPTL  180
IDAIASRLEG PPLVKLTVAG GATEDVPPML DLSYEELGSK LINFARSRNV VLEFRAIPST  240
YADGFSSLIE QLRVQHLVYA ESGEALVINC HMMLHYLPEE TLSPLSNVNS NPYSFEPSSI  300
QSLRTMFLKA LRGLDPTVVV LVDEDADLTS NNLVCRLRAA FNYLWIPYDT VDTFLPQGSK  360
QRQWYEADIS WKIENVIAHE GLQRVERLEP KSRWVQRMRN VGFRGVSFGE EAISEVKTML  420
DEHAAGWGLK KEEDDLVLTW KGHNVVFATA WLMNLETEMP GRRHFSLVDT LELKSQIERK  480
IGPIKTEKYF NLLTRFLSLK IGKPEFDRLC IGIIGRENVR LHNHLLRSII RNASLSKNHP  540
STGNKLESAL SVKAANGYQR SNLKSMCKDF PQSPRKGRTT NLRDHNHPSP LGPHGKSRGT  600
VCEDAVPRVQ EQQSATELLS LGSRPPMSLE EGEEVDQVAG SPSIRSRSPV RAPLGISLDA  660
KGMRKVPWNG LASTSETCHC KGELPDTGSL RKRLEKKLEM EGLNMSVDCP NLLNSSLDVF  720
MKRLIKPCLE LAGSRSRQKL IDQGHNWSTV SLNGMWPLRY AQKQNGSISV SMLDFRVAME  780
INSPLLGVDW PTKLEKVCLH ASEE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_B8e-434545182472Protein SHORT-ROOT
5b3h_B3e-434545128418Protein SHORT-ROOT
5b3h_E3e-434545128418Protein SHORT-ROOT
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankGQ3935381e-129GQ393538.1 Gossypium hirsutum cultivar Deltapine 33 B clone MONCS0503 SSR marker CGR5533 genomic sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017625839.10.0PREDICTED: scarecrow-like protein 32
SwissprotQ9SN221e-179SCL32_ARATH; Scarecrow-like protein 32
TrEMBLA0A1U8IWR50.0A0A1U8IWR5_GOSHI; scarecrow-like protein 32
STRINGEOX916400.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM54452750
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G49950.10.0GRAS family protein