PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.004G204000.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family B3
Protein Properties Length: 1025aa    MW: 114887 Da    PI: 8.8521
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.004G204000.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B330.65.9e-103284111099
                           HHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
                    B3  10 vlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                           +++++ lv+p + a + ++ +++   + l+d++g+  +v +  +k  ++ v+ +GW  Fv+ + +k g+f++F+++ +s+f   v+vfr
  Sobic.004G204000.1.p 328 NNSEKFLVIPPTVAPRLEYLTNQ--LVYLKDSEGKCSKVLV--SKVAETLVFHQGWDIFVSNHLIKWGEFLLFEYIAESTF--SVRVFR 410
                           4566679*******999888544..8***************..*********************************98999..999998 PP

                           S CS
                    B3  99 k 99 
                           +
  Sobic.004G204000.1.p 411 T 411
                           6 PP

2B3314.3e-1096010183596
                            EEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE..EEEE CS
                    B3   35 tltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefelvvkv 96  
                             ++l+d+  r W + +    ++  + +t+GWk+Fv an+L++gD++ +    + ++ e+v +v
  Sobic.004G204000.1.p  960 VVMLKDPMKRLWPIIY--HDNPIFVGFTAGWKHFVAANNLQTGDVCELIK--EsEDDEPVYSV 1018
                            5899************..****************************8883..33666666555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019361.14E-14317414IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.4E-14317413IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086310.362319412IPR003340B3 DNA binding domain
SMARTSM010194.0E-4322412IPR003340B3 DNA binding domain
CDDcd100171.10E-7322410No hitNo description
PfamPF023623.9E-8327411IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.2E-119251017IPR015300DNA-binding pseudobarrel domain
SMARTSM010192.7E-49301024IPR003340B3 DNA binding domain
CDDcd100175.58E-99361019No hitNo description
SuperFamilySSF1019363.53E-109381011IPR015300DNA-binding pseudobarrel domain
PfamPF023622.5E-79601017IPR003340B3 DNA binding domain
PROSITE profilePS508638.989611022IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1025 aa     Download sequence    Send to blast
MIGSAPDSAR RKPSHDRAND EVRHGDTVVK NTKRSADAFT EKRKKKDHLY NANHGKDRDI  60
RNGDQRIKKS RVPAASSEKG REKGDHDLNK TSSKKKMMGV DLDKKRTVTS RRKKLERERK  120
KKLLINASIR KNMQRDDGEE GRIMSNYYET KVKSKKVSTT LSDKERNKEK LNKTHREKKM  180
QAADSKMRNH DSGVNKGKVS TTFCDKEKKR KRPSNTNSEK ETAPITYAVK EKKMRTTESV  240
EIKMRHDRQN RRNVSLDVSN EKMDTSSGSN YKIRKRKLAH TLLKEKKRMR YNDSDKIHSG  300
RVKEMEKISG GKEKNNQAPI AFLKFIRNNS EKFLVIPPTV APRLEYLTNQ LVYLKDSEGK  360
CSKVLVSKVA ETLVFHQGWD IFVSNHLIKW GEFLLFEYIA ESTFSVRVFR TDSCERVDFN  420
PESTNKGGRK KQAWSNMPPD DLVITDGSSQ NIDDGYYVSG ECPRTKVPQT CHVTCNTKND  480
PKQVEHVVGS GVMAQDNNGK SIDPQCKTKG TSPLCSKGKT LITLIDSEDS EPLEHENGDT  540
MKLATSVADS DTSLVAVNTN EGPIRAQSGI GNGPSVVLGD EKGSSPEIEC GTKSISTTCS  600
EGKTRSQIII TSTALLDLHD SDEDLGRKQR TNVVPLDSIT PVIDYHNHSK TDIIQNLYRK  660
YEAPGGFRCL EKWRKDVVNN QASLDCTVPI KPENPQKNDS MLVDGYGSIE LNPVDEYICS  720
EGNHECVQPL FTMPIKEPSS ADRVTNCGHD GTEIDYSINE KDGGASVLLE AKGERLEPMG  780
SIVHSQSNNA PLCANPVVPG KDGATGLTPI GSEGSCAFIE SMFIGPIEKT SSPDEISKCS  840
SSMIEIEHNV NEKGTPVQFE TQMDQVEPVR SSVRSKSRNI VVRANESEHC FSKQEGRMPS  900
NTEVPEPLLP MKDKILELDY HSPPEINSQL CIPDTTQKWL GLSKSLSSAV IRQRRHHWDV  960
VMLKDPMKRL WPIIYHDNPI FVGFTAGWKH FVAANNLQTG DVCELIKESE DDEPVYSVRM  1020
CGKI*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1103121KKRTVTSRRKKLERERKKK
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.004G204000.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021315763.10.0B3 domain-containing protein Os02g0598200
TrEMBLA0A194YQQ00.0A0A194YQQ0_SORBI; Uncharacterized protein
STRINGGRMZM5G834874_P010.0(Zea mays)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP122722730
Representative plantOGRP1458744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18960.18e-11B3 family protein