PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_04412_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 1070aa    MW: 123007 Da    PI: 6.7395
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_04412_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS165.73.8e-51153703374
                        GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfs 85 
                                 e L++cA+a+++g+l+ a+++L ++ ++a+ + d + +l+ yf+eAL +r ++         pp  t+++ ++ +  ++ +++
  Cotton_A_04412_BGI-A2_v1.0  15 EILVSCAHAIEDGNLKIADSFLHQIWNTAAVELDLISKLVRYFAEALVRRAYGLH-------PPYYTHSNLQIPHPLYYYYYY 90 
                                 689**************************************************32.......444444444555555555555 PP

                        GRAS  86 evsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakf 168
                                   + i    + +  +aI  a+ g++  H iDf i +      L+++L +R+++p s+RiT+v +   +++   +e  e L++ 
  Cotton_A_04412_BGI-A2_v1.0  91 SRFDI----NEMVGEAIESATTGKKGFHLIDFHIPHLYGRGYLFKTLPNRSSDPLSVRITVVLPTFLKNTVDFQEEMEYLTEA 169
                                 54443....566789********************9999999**********************7777999************ PP

                        GRAS 169 Aeelgvpfefnvl...vakrledleleeLrvkp...gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqe 245
                                  + l+++++ + l   +a++l +++ ++L++++   +Eal+V   ++ h+ll+e  ++++     L  +++l+P++v++ eq 
  Cotton_A_04412_BGI-A2_v1.0 170 GKLLKIELKKEDLrvvYANSLGEVDESTLDLRRtndDEALVVYYNFKFHTLLAEAEAMKK----ELIKLRQLNPEIVIMQEQY 248
                                 ********86555444999**********9999999****************99999999....78889************** PP

                        GRAS 246 adhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvp 328
                                 a+ n+++F +r+  +++yys +f  +++ + + +    +  +    r+i n+vaceg++r++rh++l +Wr+ l +aGF ++p
  Cotton_A_04412_BGI-A2_v1.0 249 ANDNDGNFIKRLEYSFRYYSNFFQYYSNLFKSGKPLGYNTAKY-YMRQIHNIVACEGRDRIMRHQSLDEWRDLLLTAGFLQIP 330
                                 ***************************9998888888888777.779************************************ PP

                        GRAS 329 lsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                 +++++    + l + + ++  +++ee+g+lvl  kd p+++vS+Wr
  Cotton_A_04412_BGI-A2_v1.0 331 FQKDV----ENLHALYWVE--EIKEEKGCLVLSHKDCPILFVSCWR 370
                                 98765....5666666666..89**********************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098529.3211356IPR005202Transcription factor GRAS
PfamPF035141.3E-4815370IPR005202Transcription factor GRAS
PROSITE profilePS5060011.637702909IPR003653Ulp1 protease family, C-terminal catalytic domain
SuperFamilySSF540013.92E-26779937No hitNo description
Gene3DG3DSA:3.30.310.1301.3E-9788904No hitNo description
PfamPF029028.4E-16812935IPR003653Ulp1 protease family, C-terminal catalytic domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006508Biological Processproteolysis
GO:0008234Molecular Functioncysteine-type peptidase activity
Sequence ? help Back to Top
Protein Sequence    Length: 1070 aa     Download sequence    Send to blast
MASSSSFSSA DAALEILVSC AHAIEDGNLK IADSFLHQIW NTAAVELDLI SKLVRYFAEA  60
LVRRAYGLHP PYYTHSNLQI PHPLYYYYYY SRFDINEMVG EAIESATTGK KGFHLIDFHI  120
PHLYGRGYLF KTLPNRSSDP LSVRITVVLP TFLKNTVDFQ EEMEYLTEAG KLLKIELKKE  180
DLRVVYANSL GEVDESTLDL RRTNDDEALV VYYNFKFHTL LAEAEAMKKE LIKLRQLNPE  240
IVIMQEQYAN DNDGNFIKRL EYSFRYYSNF FQYYSNLFKS GKPLGYNTAK YYMRQIHNIV  300
ACEGRDRIMR HQSLDEWRDL LLTAGFLQIP FQKDVENLHA LYWVEEIKEE KGCLVLSHKD  360
CPILFVSCWR PRAGEEHFKF NLNSNNLGQE VCFRYELPVA LTWACEATTD KIMLDGKKHT  420
LFMERTSCYA SNEGSQCFME ACAKHHIQEG QAIAGKAFQS SANFHFKPSI TKLMKSDYPL  480
FNAAQLFGSH AVVAICLQNH YIIGDVYVVE FYWPEIESEK SESLALDIFS DLKNMKKKFV  540
TIRVGGNEVG FEREAISTTL QGTMHMRNAQ PASSTNDLLS SNTTWSLNAV QQCDVHEMER  600
HGLVEQVESA PFSTPNPMSY GGVLQTQGPH KQYKRKRKME DHSNVVKVSK NWNREEANSS  660
GDIAKGSIKN EPNHGRNITA LIEIPKDDSP FSNNKSGQFQ SLSSESDNET TLKEQGSWCK  720
EDVRAYLVSR FTGKENKRLN RWQTNELIGK LIGRDKEFCL MGDKLAPLLM VPHGDETRKE  780
YYIDDCVVNT FFKLLKKRSD RFPKAYINHY SFDSQIALFL PLCLSAHWVL FYVNTKEKKI  840
SWLDSNPSSR IMSNNVEKQT ILQWFTTFLL PEFGYYDANE WPFLVRTDIP VQKNWVDCGV  900
FVMKYGDCLT HGDFFPFTQN DMVHFRRRIF LDIYRGRLYV YCSLISKDIE LEHGYPSEAL  960
DLCNSLLRTD LKPNEAIITS ILSACADLGS LSIRNEIELY VINSHSRLVE DGLKYFKSMK  1020
DNFGIKPGIE HYTCLVDLLG RVGHFNLALK IENHPRDAYA STSSSLGSIA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6nnq_A2e-1279193878223Sentrin-specific protease 1
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016714041.10.0PREDICTED: uncharacterized protein LOC107927484 isoform X3
TrEMBLA0A1U8LHF60.0A0A1U8LHF6_GOSHI; uncharacterized protein LOC107927484 isoform X3
STRINGGorai.005G014600.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM8809420
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01570.12e-39GRAS family protein