PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sphfalx0105s0045.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Sphagnophytina; Sphagnopsida; Sphagnales; Sphagnaceae; Sphagnum
Family B3
Protein Properties Length: 842aa    MW: 93585.6 Da    PI: 6.0625
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sphfalx0105s0045.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B344.92.1e-145235853397
                           -SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
                    B3  33 sktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97 
                           +++ +l+d+ g  W+vk  ++ +++   l++GW++F  +++L+egD++vF+ ++++e +l+v++f
  Sphfalx0105s0045.1.p 523 TTDAILYDTAGGAWQVKWLVNSSGR--RLSAGWRKFSVDHQLDEGDVCVFEIVKKDELSLLVHIF 585
                           56899*************5555555..59**********************99999999***998 PP

2B330.94.7e-10681769496
                           E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE.. CS
                    B3   4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefel 92 
                           ++ +s+v ++  +++p  fa++   +    + +tl d+ g+s++ + i ++k+++++  k  ++F  ++ L+egD ++F+l++ + ++l
  Sphfalx0105s0045.1.p 681 IMRKSSVYRHFDVSIPCMFAKRWL-P-SALMRVTLVDSAGQSFTARWIGNRKKSQCL--KSFRDFSISHCLEEGDACIFELMDPNPGTL 765
                           67899***************8884.3.35689***************8877777764..48********************98899989 PP

                           EEEE CS
                    B3  93 vvkv 96 
                           v+kv
  Sphfalx0105s0045.1.p 766 VFKV 769
                           8887 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.104.1E-1136131IPR015300DNA-binding pseudobarrel domain
CDDcd100172.43E-737130No hitNo description
SuperFamilySSF1019367.26E-1337137IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.01439132IPR003340B3 DNA binding domain
PROSITE profilePS508637.66940132IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.4E-13494582IPR015300DNA-binding pseudobarrel domain
CDDcd100171.44E-13495586No hitNo description
SuperFamilySSF1019362.16E-14496585IPR015300DNA-binding pseudobarrel domain
SMARTSM010195.2E-6497588IPR003340B3 DNA binding domain
PfamPF023622.0E-11523585IPR003340B3 DNA binding domain
PROSITE profilePS5086311.561527588IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.0E-14670773IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019367.06E-15671773IPR015300DNA-binding pseudobarrel domain
CDDcd100177.50E-15676773No hitNo description
SMARTSM010190.01678775IPR003340B3 DNA binding domain
PROSITE profilePS508637.711678775IPR003340B3 DNA binding domain
PfamPF023623.2E-9681769IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 842 aa     Download sequence    Send to blast
MESSAFDDMA FRACAICRRS CFERHGNDTN EAALGSPSFL KLLGDDDICN SLSSLHIPSM  60
FMASNDCRIW DTVTLKGPTG FMGEAVVAHD HGYSFQGGWK EFVKHQNIES GDTVLCTLIA  120
DSVFAIKFYD KLGCEKTFQS GGGVHAESAA GKQTDANLQG TSGETVIPPH SNAPADRMLS  180
CEENRINETE QGCEEALMTA APAELGVQHR KRPSELKSPD QQRKKRPCIE QLVLGVAERG  240
IEEQRKRLQK LLRQSRMRMK YDGKFGNSEY EWEEEEDQVC DLSAAPVRLE GFPTAQVLES  300
EESAGYEELC TVQQGRDYGS HMSSQAPESE SLCAMECVQA SDARVVRKDS SPNSEVHIMN  360
KMSSSRHIMK DATGTITMEQ RGIGWRDSAS QSQNFVGQGQ TPYFLAEKLL TSYCGTSCKN  420
SGDGAAHLQK GILVELSDDE SDEKIEPTSI AGEGHPLLQQ RVKIREVPLF RSRPPTKAER  480
QAAIDLAHEV SIINPHTRIV MKHSQVCAGF SVNLGLAGMP NVTTDAILYD TAGGAWQVKW  540
LVNSSGRRLS AGWRKFSVDH QLDEGDVCVF EIVKKDELSL LVHIFPIVIL EDQPGTESPN  600
TIMGGEMKQP VTPAVVPLDP SATSSFLGNA DWADDGVLKG LPFNYSWSLS SRRHPITKQE  660
RRHTEATARA FETRNPSTTV IMRKSSVYRH FDVSIPCMFA KRWLPSALMR VTLVDSAGQS  720
FTARWIGNRK KSQCLKSFRD FSISHCLEEG DACIFELMDP NPGTLVFKVH IFRVVELKSG  780
QCGPDDWQKH YHLLTGRGFI QEAENPFRMD LEVVPVQQQE ATDPLPLSHR VERERVFQRQ  840
I*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1209225RKRPSELKSPDQQRKKR
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024381077.18e-99uncharacterized protein LOC112284923
RefseqXP_024381078.18e-99uncharacterized protein LOC112284923
RefseqXP_024381079.18e-99uncharacterized protein LOC112284923
TrEMBLA0A2K1KD752e-97A0A2K1KD75_PHYPA; Uncharacterized protein
STRINGPP1S97_269V6.13e-98(Physcomitrella patens)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.15e-12B3 family protein