PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10000111m
Common NameEUTSA_v10000111mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family B3
Protein Properties Length: 550aa    MW: 62704.2 Da    PI: 7.8402
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10000111mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B3474.5e-15694196
                     EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
               B3  1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93
                     ffk l     + s+ l++p  f+ ++  g ++++++t +l++  s  +Wevk+  + + +   lt+GWkeFv a++L++gD+ +F+ ++++ f  +
  Thhalv10000111m  6 FFKPL--LPGFHSH-LTIPVAFFLKYikGTNEQKKTTAKLRSdASDITWEVKI--EDG-Q--KLTDGWKEFVLAHDLRVGDIAIFRQEKDMAF--H 91
                     66666..5667777.************9888889999***997888*******..333.2..4***********************7776777..6 PP

                     EEE CS
               B3 94 vkv 96
                     v++
  Thhalv10000111m 92 VTL 94
                     665 PP

2B367.81.6e-21143226489
                      E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSE CS
               B3   4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrse 89 
                      + +++ +l+ + l lp +fa+++g++++++ +++l++ +grsW++ l  +kk g++++++GW+ F++ang k+g+ ++Fkl++r +
  Thhalv10000111m 143 AHVTHASLRYDSLNLPMSFARANGLNTRCG-EIVLMNDKGRSWTLAL-KQKKCGSTYIRRGWRTFCSANGFKAGEAFTFKLIQRGK 226
                      556788899999**************9987.****************.47777789*************************99633 PP

3B381.96.2e-26293382495
                      E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
               B3   4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                      + +++++l+++rl+lp++f +e+g++++++ +++l++e grsW+++l+ +++ g+ ++++GW++F++angL++gD+++Fkl++r +++lv++
  Thhalv10000111m 293 AKVSPSTLRQDRLYLPRNFSRENGLDTRCG-EIVLMNEMGRSWTLNLKRKNSCGTAYIRRGWRSFCRANGLRAGDSITFKLIQR-GGTLVLR 382
                      56789999******************9987.****************98888888***************************95.5556665 PP

4B353.93.2e-17451543599
                      -..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
               B3   5 ltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                      ++++ +l++  + lp +f++ hg+++e  ++++l d++g +W+ +l+ + +++r+ + +GW+eF kan +k g++v++kl+++ ++++v+k+++
  Thhalv10000111m 451 TLKPFNLTKYVMLLPIPFTRMHGINEE--TKMSLVDKHGVRWSTNLRSEITGDRIRMVGGWQEFFKANCVKIGESVMLKLIWEGDKSCVLKFCS 542
                      5556667777899**********9954..58****************************************************9999***9998 PP

                      S CS
               B3  99 k 99 
                      k
  Thhalv10000111m 543 K 543
                      6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019368.83E-193105IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.106.2E-223111IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.012398IPR003340B3 DNA binding domain
CDDcd100177.51E-14494No hitNo description
SMARTSM010193.0E-19698IPR003340B3 DNA binding domain
PfamPF023623.2E-101491IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.8E-22133232IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.14E-18135232IPR015300DNA-binding pseudobarrel domain
SMARTSM010199.9E-24140236IPR003340B3 DNA binding domain
CDDcd100175.15E-18140232No hitNo description
PROSITE profilePS5086315.326140235IPR003340B3 DNA binding domain
PfamPF023621.9E-20143229IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.106.5E-24286383IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019363.53E-24288388IPR015300DNA-binding pseudobarrel domain
CDDcd100171.14E-26290383No hitNo description
PROSITE profilePS5086318.034290387IPR003340B3 DNA binding domain
PfamPF023621.9E-24291382IPR003340B3 DNA binding domain
SMARTSM010194.3E-34291387IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.8E-17439543IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019367.65E-19443541IPR015300DNA-binding pseudobarrel domain
CDDcd100171.17E-14446541No hitNo description
PfamPF023622.4E-15447543IPR003340B3 DNA binding domain
SMARTSM010191.5E-18447544IPR003340B3 DNA binding domain
PROSITE profilePS5086312.746447543IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009793Biological Processembryo development ending in seed dormancy
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 550 aa     Download sequence    Send to blast
MANKHFFKPL LPGFHSHLTI PVAFFLKYIK GTNEQKKTTA KLRSDASDIT WEVKIEDGQK  60
LTDGWKEFVL AHDLRVGDIA IFRQEKDMAF HVTLMGPSCC EIQYESGLDD ENNLGKIPKK  120
TKVKKNPRKE TESSSLDPSC FVAHVTHASL RYDSLNLPMS FARANGLNTR CGEIVLMNDK  180
GRSWTLALKQ KKCGSTYIRR GWRTFCSANG FKAGEAFTFK LIQRGKVPVL RLSTTESEEE  240
EESSGADEVE SLSTEPESDE ESNLAEIQRR KKLKKNPERE TESYPLDPSY FVAKVSPSTL  300
RQDRLYLPRN FSRENGLDTR CGEIVLMNEM GRSWTLNLKR KNSCGTAYIR RGWRSFCRAN  360
GLRAGDSITF KLIQRGGTLV LRLSPTDSEE EEEEEEEEEE EESSEGDEIE SLSTEQESDE  420
EGNQDEKSFK KPRLLWKASS TPSQNRFVTL TLKPFNLTKY VMLLPIPFTR MHGINEETKM  480
SLVDKHGVRW STNLRSEITG DRIRMVGGWQ EFFKANCVKI GESVMLKLIW EGDKSCVLKF  540
CSKLKQVTK*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1267274QRRKKLKK
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10000111m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_013743891.10.0B3 domain-containing protein REM14
SwissprotO230760.0REM15_ARATH; Putative B3 domain-containing protein REM15
TrEMBLV4LRK30.0V4LRK3_EUTSA; Uncharacterized protein
STRINGXP_006405006.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM20422261
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00260.10.0B3 family protein
Publications ? help Back to Top
  1. Mantegazza O, et al.
    Analysis of the arabidopsis REM gene family predicts functions during flower development.
    Ann. Bot., 2014. 114(7): p. 1507-15
    [PMID:25002525]