PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.003G393600.3.p
Common NameSb03g043110, SORBIDRAFT_03g043110
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family B3
Protein Properties Length: 688aa    MW: 75571 Da    PI: 5.0937
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.003G393600.3.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B341.52.4e-1391178197
                           EEEE-..-HHHHTT-EE--HHH.HTT.---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSS CS
                    B3   1 ffkvltpsdvlksgrlvlpkkfaeeh.ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrs 88 
                           ffkv+    ++++g + +p +fa+++ ++   +   + led  g +W+v+l  + ++g   ++ GWk+Fv ++ +  g+f+vF+ + rs
  Sobic.003G393600.3.p  91 FFKVM--MGYFSEG-MDIPSPFARTIwDLA--G-SNIFLEDAFGLRWRVRL--CLRDGVLSFGHGWKNFVLDHAVSCGEFLVFRQIARS 171
                           77777..5566666.***********8444..2.26***************..88999999***********************98888 PP

                           EE..EEEEE CS
                    B3  89 efelvvkvf 97 
                            f  +v++f
  Sobic.003G393600.3.p 172 VF--TVQMF 178
                           88..77766 PP

2B332.12e-106176823097
                           ..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
                    B3  30 keesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97 
                           k++ k ++l+d+  r W v +  + +++   + +GW + +++n+L++gD++ F+l+g+se ++ ++v+
  Sobic.003G393600.3.p 617 KHGRKVVILKDPCMRLWPVLY--QCTPRFNGFITGWVDICRENRLQQGDTCEFELSGNSELSFQLQVR 682
                           3445689**************..66666655667**********************998887777665 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019362.55E-2088181IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.103.3E-1888181IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.15388181IPR003340B3 DNA binding domain
CDDcd100173.04E-1689178No hitNo description
SMARTSM010193.8E-1191181IPR003340B3 DNA binding domain
PfamPF023622.0E-1091178IPR003340B3 DNA binding domain
SMARTSM010190.024591685IPR003340B3 DNA binding domain
CDDcd100173.60E-11607683No hitNo description
SuperFamilySSF1019364.32E-13609678IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.9E-11617679IPR015300DNA-binding pseudobarrel domain
PfamPF023621.5E-8617681IPR003340B3 DNA binding domain
PROSITE profilePS508639.53623683IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 688 aa     Download sequence    Send to blast
MQPAMDAEVV GVAVKQEPEE IVAEADGDEE GEPEKVVVKR RRKKKTACDP HKKRACVDCT  60
KRCARIHGRP ASPSALPSSS NARPVPAVPS FFKVMMGYFS EGMDIPSPFA RTIWDLAGSN  120
IFLEDAFGLR WRVRLCLRDG VLSFGHGWKN FVLDHAVSCG EFLVFRQIAR SVFTVQMFAP  180
SAVERLYHCE KNKRQSRKRK PRQKTCSPSI QTVKVTKNSV KNSKKRLRTD DQQNGIRPRC  240
RKSKMAAEVC IDESDVPDSA SEPKCSDTSE RVPEAGAAEP QEISEAPAGH ECEVQGVLDG  300
EAKIADDSTI LGEDQSNHNA ISASIMQVST ANEIEPGEGL NLPTDFDASV PLAMMDLNEV  360
SIDDIFLSAD IYEFESDMCN PESFSVDLNM VEPITTGQTS GFSCLEDTPQ NHLSSMGDGH  420
RSLIPEAVLC IENKEMTDVL GTGTSYFVDS SVHDIDINAL PANEPPSFGE DNPSPQADAE  480
MHSNECGLSS CNKDKGNSLL PLMNKQTAHK EYSSMTTEQD KAQGGQCNMQ DSIRQHATEI  540
MSRSAKPHEL ADLRQNHLQT VQPAGNGSES IHSGASESGG VLALTTNSIE FCIDVPAPGQ  600
TWLELPGRLP VLPRTKKHGR KVVILKDPCM RLWPVLYQCT PRFNGFITGW VDICRENRLQ  660
QGDTCEFELS GNSELSFQLQ VRVPNTQ*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13844KRRRKKK
23853KRRRKKKTACDPHKKR
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.003G393600.3.p
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021310776.10.0B3 domain-containing protein Os01g0905400 isoform X3
SwissprotQ5N6V00.0Y1054_ORYSJ; B3 domain-containing protein Os01g0905400
TrEMBLA0A1W0W1080.0A0A1W0W108_SORBI; Uncharacterized protein
STRINGSb03g043110.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP114613240
Representative plantOGRP89721014
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G24700.17e-09B3 family protein
Publications ? help Back to Top
  1. Kikuchi S, et al.
    Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.
    Science, 2003. 301(5631): p. 376-9
    [PMID:12869764]