PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D02G0132
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 1396aa    MW: 161077 Da    PI: 5.2121
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D02G0132genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS94.12.1e-29153803374
         GRAS   3 elLlecAeavssgdlelaqalLarlselaspdg.dpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaN 99 
                  +lLl+cAea+++gdl+ a+a+L+++  la ++      R+++yf+ AL +r ++        l+p ++          ++++ ++ sP++  +  + N
  Gh_D02G0132  15 RLLLSCAEAIEDGDLKSADAYLQNILILADERPyLYKSRVVKYFADALVRRAYG--------LHPASS----------YFTFPVDPSPYYHCGSYLIN 94 
                  79**************************9998867899****************........222211..........22222344444444444444 PP

         GRAS 100 qaI........le..avegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfe..fnvlvakr 185
                    I        +e  a+ g++++H iDf+i +     +  ++L + +++p  +R+  + +p  ++  +  +  e L++ A+e++v++e   ++++ ++
  Gh_D02G0132  95 GVIenvihdalMEknALMGNRKLHLIDFSIPYSSFQNSVVRTLPTFSGDPLPVRVSYILPPFLKKYVKFLRQMEFLTRDAKEVNVKLEdeLKLVYGNS 192
                  4431111000032225667799**************************************887755555555578************7336778999* PP

         GRAS 186 ledleleeLrvkp...gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklprese 280
                  l++++  e+++k+   +E+++V   ++l +l+++  ++e      L  +k+++P++v++ +  ++h++++Fl+ f ++++y  + +d++e+       
  Gh_D02G0132 193 LAEVDECEIDLKRrrdDEMVVVYYKFKLDKLVRDAKAMER----ELVRLKEINPTIVIMLDFYSNHTHSNFLTCFKDSFQYSLKTLDCWEEL------ 280
                  *********9999999****************99888888....8999*****************************************887...... PP

         GRAS 281 erikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk..sdgyrve.........eesgslvlgWkdrpL 367
                  +  + E+   +   + + a eg++ + rh tl +W++ +++aGF+ +pl++++   ++l+++ v+  +d +++          +e+++l+lg k+ p+
  Gh_D02G0132 281 DLYFDEEYAWE---CHIEAWEGNNVIRRHPTLTEWQHLFSMAGFSRIPLNHRE--GIDLIVKDVNplNDFFSMSnqswleimgKEEECLILGYKECPM 373
                  33444444333...3455679999999**********************8764..455555555553333444433333366677************* PP

         GRAS 368 vsvSaWr 374
                  +++SaW+
  Gh_D02G0132 374 FFLSAWK 380
                  ******8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098517.6431351IPR005202Transcription factor GRAS
PfamPF035147.3E-2715380IPR005202Transcription factor GRAS
PROSITE profilePS5080811.091685739IPR003656Zinc finger, BED-type
SMARTSM006143.8E-16685735IPR003656Zinc finger, BED-type
SuperFamilySSF576671.9E-7688737No hitNo description
PfamPF028924.1E-9688728IPR003656Zinc finger, BED-type
SuperFamilySSF530981.37E-418261270IPR012337Ribonuclease H-like domain
PfamPF143724.3E-1510491137IPR025525hAT-like transposase, RNase-H fold
PfamPF056992.7E-1711881269IPR008906HAT, C-terminal dimerisation domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1396 aa     Download sequence    Send to blast
MASSLFDIDT DTALRLLLSC AEAIEDGDLK SADAYLQNIL ILADERPYLY KSRVVKYFAD  60
ALVRRAYGLH PASSYFTFPV DPSPYYHCGS YLINGVIENV IHDALMEKNA LMGNRKLHLI  120
DFSIPYSSFQ NSVVRTLPTF SGDPLPVRVS YILPPFLKKY VKFLRQMEFL TRDAKEVNVK  180
LEDELKLVYG NSLAEVDECE IDLKRRRDDE MVVVYYKFKL DKLVRDAKAM ERELVRLKEI  240
NPTIVIMLDF YSNHTHSNFL TCFKDSFQYS LKTLDCWEEL DLYFDEEYAW ECHIEAWEGN  300
NVIRRHPTLT EWQHLFSMAG FSRIPLNHRE GIDLIVKDVN PLNDFFSMSN QSWLEIMGKE  360
EECLILGYKE CPMFFLSAWK PKVEEEHLNF NSSNGKFGQG FNPYPSPLRP LRPFPEGLTL  420
SRVAAVAEIY DILNHLYCEH KFSLALTWVS KVDNMNETMS DPNKKYTFSI QSNSCYSKDW  480
NSYKFMRSCE YKIEQTIIEK ALESKDGYHF EPSLTKSDID DYMYLQRAKH CNVDVVVAIC  540
LQNRYTSNDI YVVEFYWPTT ESEISKSFTP RIFNDLKHME KKFVTVKVQG TEQAISNIPT  600
SSYTARPLKI AEETEDVDAV ELNGVNVQRG VVPNFPSPIT IQSSSKVVAA PSNTLEGPHN  660
QIFPNGDPEI VKANKQEPSK ATQRELRSKV WDHFDRFEED EKQVAKCKHC PKVLTGSSKS  720
GTTHLNNHSK VCPGKKKQNQ ESQLILPVDT NEGSLRFDKK RSHMDLAKMM IKLQCPLDMA  780
EQETFKNFVK GLQPMFEFQS KDILSYIHRI YDEEKEKLQL YFDKLASKFN LTVSLLKNNS  840
GKTIYCCLIS HFIDDGWELK RKILALKTLE HINDTKALGE IIRSLVLEWN ISNKVCSITV  900
DNSFLNDSMV DQIKEICLSD QGSVSSDHWF ISFTLLEDGF REMDGILFKL RKSIEYVTET  960
RHGKLKFQEA VDQVKLQGGK LWDDLSFRLK SDFDILDSAL RSREIFCKLE QIDDNFKLNP  1020
TMEEWENAVA LQSCLKCFDD VKGTQCLPVS LYLPKLCDTY KKFLQLEKSS HSFVTLMKRK  1080
FDRYWSLCNL ALAVASVLDP RLKFKIVELS YRVIYGHDSK MRLNMFHKVL RDVYYEYASE  1140
AKNLTSSASV LDDFNCSTIV LGNDSILDSL SKFASASNFN EEASWKLELE LYLDEPLLPM  1200
DGAFFDILGW WCDKSQRFPI LAKMAQDFLA IPVSISTSCS NISAMINNPA YGSLNPESME  1260
ALVCSENWLE TPKEKSIGEK AEIMKALVCN DSRLESSIGK PDHEKNVDDV IEILNYDLLF  1320
DNNQSDEVQS SSSESEDETT LKEEGPWCEQ DIKAYLLSRF TSKEYKRLDK WRKDELNGYI  1380
ASFDVLIVEY EMIIIT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1758762KKRSH
Expression -- Microarray ? help Back to Top
Source ID E-value
GEO722783951e-124
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5837935e-42JX583793.1 Gossypium hirsutum clone NBRI_GE16930 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016728589.10.0PREDICTED: uncharacterized protein LOC107939729 isoform X4
RefseqXP_016728590.10.0PREDICTED: uncharacterized protein LOC107939729 isoform X4
TrEMBLA0A1U8MP620.0A0A1U8MP62_GOSHI; uncharacterized protein LOC107939729 isoform X4
STRINGGorai.005G031000.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM8809420
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G14920.15e-30GRAS family protein