PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.004G003401.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family MIKC_MADS
Protein Properties Length: 477aa    MW: 52794.9 Da    PI: 6.8533
Description MIKC_MADS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.004G003401.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SRF-TF88.82.9e-284089251
                          ---SHHHHHHHHHHHHHHHHHHHHHHHHHHT-EEEEEEE-TTSEEEEEE- CS
                SRF-TF  2 rienksnrqvtfskRrngilKKAeELSvLCdaevaviifsstgklyeyss 51
                          rie+++ rqvtfskRr g+lKKA+ELSvLCdaeva+i+fs++g+ly ++s
  Sobic.004G003401.1.p 40 RIEDSTSRQVTFSKRRSGLLKKAYELSVLCDAEVALIVFSPRGRLYQFAS 89
                          8***********************************************86 PP

2K-box73.94.4e-251132021099
                 K-box  10 eeakaeslqqelakLkkeienLqreqRhllGedLesLslkeLqqLeqqLekslkkiRskKnellleqieelqkkekelqeenkaLrkkl 98 
                            e+  e+++ e++ L k+i+ +++ +R+llGe+L+s+s++eL++Le qLeksl+ iR++K++ l++qi el++ke++l  en +Lr + 
  Sobic.004G003401.1.p 113 VETGIEKWKYEATTLGKKIDAIETYKRKLLGENLGSCSVQELKELEAQLEKSLSIIRQRKERKLMDQILELREKEQKLLMENAMLRDQC 201
                           567789********************************************************************************987 PP

                 K-box  99 e 99 
                           +
  Sobic.004G003401.1.p 202 K 202
                           6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM004321.9E-363190IPR002100Transcription factor, MADS-box
PROSITE profilePS5006629.9693191IPR002100Transcription factor, MADS-box
SuperFamilySSF554552.09E-2933115IPR002100Transcription factor, MADS-box
PROSITE patternPS0035003387IPR002100Transcription factor, MADS-box
PRINTSPR004043.0E-283353IPR002100Transcription factor, MADS-box
PfamPF003191.0E-264087IPR002100Transcription factor, MADS-box
CDDcd002651.44E-3441108No hitNo description
PRINTSPR004043.0E-285368IPR002100Transcription factor, MADS-box
PRINTSPR004043.0E-286889IPR002100Transcription factor, MADS-box
PROSITE profilePS5129714.553117207IPR002487Transcription factor, K-box
PfamPF014863.2E-25118201IPR002487Transcription factor, K-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 477 aa     Download sequence    Send to blast
MVMATAAAAM DVDAPAPAPV PDGNAAANKQ GRRGRREMRR IEDSTSRQVT FSKRRSGLLK  60
KAYELSVLCD AEVALIVFSP RGRLYQFASA ADLQNTIDRY LKHTEGTLAN GKVETGIEKW  120
KYEATTLGKK IDAIETYKRK LLGENLGSCS VQELKELEAQ LEKSLSIIRQ RKERKLMDQI  180
LELREKEQKL LMENAMLRDQ CKALPLLELN DNKEHDHHMD GAGDGGEDDE AAAAKEDVET  240
ELAIGTVVMD KAIMSNNQAG KVLKKGKKKQ AKDELDRQKQ AEKKRRRLEK ALANSAAIIS  300
ELEKKKQKKK EEQQRLDEEG AAIAEAVALH VLIGEDSDEP CHLMLNKHIR CNHWDASAAF  360
EFTVDAQSTD IYPSDDGLIC ASHAYAPRPK GRWADWGIGQ PLPSWGEVSD LQGPYYQGTF  420
HQSVTCPGFI AAQAVSSLQI REESSEITSP SQGAAAATVV NRMLGGTNRL NLYREI*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1n6j_A1e-1633143293Myocyte-specific enhancer factor 2B
1n6j_B1e-1633143293Myocyte-specific enhancer factor 2B
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1266286KKKQAKDELDRQKQAEKKRRR
2283287KRRRL
3284307RRRLEKALANSAAIISELEKKKQK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.198001e-148root
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.004G003401.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0678540.0BT067854.1 Zea mays full-length cDNA clone ZM_BFc0171J16 mRNA, complete cds.
GenBankEU9564830.0EU956483.1 Zea mays clone 1564301 hypothetical protein mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002451346.11e-176MADS-box protein SOC1
TrEMBLA0A1Z5RKP10.0A0A1Z5RKP1_SORBI; Uncharacterized protein
STRINGSi019985m0.0(Setaria italica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP38051616
Representative plantOGRP387233
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G45660.11e-47AGAMOUS-like 20