PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.1150s0033.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family B3
Protein Properties Length: 864aa    MW: 98142.7 Da    PI: 10.0373
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.1150s0033.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B348.41.7e-151499195
                         EEEE-..-HHHHTT-EE--HHH.HTT.---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE CS
                   B3  1 ffkvltpsdvlksgrlvlpkkfaeeh.ggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsef 90
                         ffk l     ++++ l +p  f+++h +g + ++kt +l++  s ++W vk+  ++ +    lt+GW+eF+ a++L+ gD+vvF+++g++ f
  Cagra.1150s0033.1.p 14 FFKPL--LPGFRTH-LNIPVAFFSKHvEGRNDQNKTARLRSdASDETWLVKM--DGLK----LTDGWEEFAFAHDLRIGDIVVFRHEGEMVF 96
                         67777..6677777.99**********77778999*****97889*******..6666....***********************9998888 PP

                         ..EEE CS
                   B3 91 elvvk 95
                           +v+
  Cagra.1150s0033.1.p 97 --HVT 99
                         ..665 PP

2B358.61.1e-181732591199
                          HHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                   B3  11 lksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                          l+++++ +p +fa+ +g++k ++k++ l +e+grsW++ l ++k + ++++++GW++F+ ang+ +g   +F+l++ +++ +v++++r+
  Cagra.1150s0033.1.p 173 LRKNLFNIPLTFARLNGLNKMRGKKIYLHNEEGRSWKLGLVHDKAGMHTYFKSGWRSFCTANGISQGRY-TFQLVR-KSAPPVIRLCRS 259
                          678999*************************************************************96.***998.8999******97 PP

3B367.42e-21274367298
                          EEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE. CS
                   B3   2 fkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefe 91 
                          fk  +++++l+ ++l+lp++f++++g++k+ + +++l++e+g +W + l+   k+ +++ltkGW +F++ ng+k+gDf+ Fkl g + +e
  Cagra.1150s0033.1.p 274 FKRHVSPSSLRYDQLYLPRSFVSSNGLDKRFG-EIILKNEQGCKWPLVLK-HSKPITTYLTKGWTNFCHVNGIKVGDFFKFKLAG-TWEE 360
                          6888999*********************9887.****************4.45556699***********************998.7778 PP

                          .EEEEE- CS
                   B3  92 lvvkvfr 98 
                          +v++++ 
  Cagra.1150s0033.1.p 361 PVLSLCP 367
                          8998886 PP

4B353.25.4e-17488569688
                          ..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSS CS
                   B3   6 tpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrs 88 
                          +++++l  +rl+lp  f +++g+ k ++  ++l d +grsW+++l+y+  + +++++ GW +F++ang+ +g  ++Fkl++ +
  Cagra.1150s0033.1.p 488 VTASSLIYDRLYLPLIFERSNGLHKMSGERIVLLDGEGRSWNLNLKYNEAGMHTYIRPGWTRFCDANGMSQGQKFTFKLVQ-K 569
                          667777889*********************************************************************987.3 PP

5B368.68.8e-22599688697
                          ..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
                   B3   6 tpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                          +++++l+++rl+l+++f+++ g++k    +++le+e+g+ W++ l++ k+ ++++l++GW  F+++ngLk gD++ Fkl+g +++++v++
  Cagra.1150s0033.1.p 599 VTASSLTTDRLCLSRSFVRSSGLDKGY-EEIVLENEWGKGWNLVLKHYKSCCSTILGGGWTTFCQDNGLKPGDSFKFKLVG-TGERPVLS 686
                          67899999*************999865.6**************************************************99.66667777 PP

                          EE CS
                   B3  96 vf 97 
                          +f
  Cagra.1150s0033.1.p 687 LF 688
                          66 PP

6B341.52.5e-137748591699
                          EE--HHH.HTT....---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                   B3  16 lvlpkkfaeeh....ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                          l+l  + ++++    g++k++  ++tl  ++g +  v l  ++  gr+ ++kGW+eF+ka g+k  +++v++l++++e+++v+k++ k
  Cagra.1150s0033.1.p 774 LTLTQSAFQTYklmnGINKTG--KITLLGQDGVKRVVDLFLDRICGRMRFGKGWREFCKAEGVKIDESFVLELIWEEEARPVFKFCTK 859
                          466666666667775666555..8********999999988*******************************************9975 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.101.8E-218106IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.1E-199111IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.63211104IPR003340B3 DNA binding domain
CDDcd100171.68E-141299No hitNo description
SMARTSM010195.1E-2114104IPR003340B3 DNA binding domain
PfamPF023622.5E-92196IPR003340B3 DNA binding domain
PROSITE profilePS5086311.589163260IPR003340B3 DNA binding domain
SMARTSM010196.5E-19165260IPR003340B3 DNA binding domain
SuperFamilySSF1019362.16E-13167259IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.103.4E-14168259IPR015300DNA-binding pseudobarrel domain
PfamPF023621.8E-16172259IPR003340B3 DNA binding domain
CDDcd100171.78E-10173258No hitNo description
Gene3DG3DSA:2.40.330.107.9E-22266367IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.1E-20269371IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086314.24273368IPR003340B3 DNA binding domain
CDDcd100171.14E-17273354No hitNo description
SMARTSM010196.0E-26273369IPR003340B3 DNA binding domain
PfamPF023622.6E-20274367IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.6E-17476570IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.35E-16477572IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086313.592483581IPR003340B3 DNA binding domain
CDDcd100177.32E-13483569No hitNo description
SMARTSM010195.8E-19484581IPR003340B3 DNA binding domain
PfamPF023623.2E-16487574IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.108.5E-19587685IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.29E-19590688IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086314.029594693IPR003340B3 DNA binding domain
SMARTSM010195.9E-24595691IPR003340B3 DNA binding domain
CDDcd100174.70E-18599688No hitNo description
PfamPF023621.3E-20599688IPR003340B3 DNA binding domain
SMARTSM010192.4E-9759860IPR003340B3 DNA binding domain
SuperFamilySSF1019361.8E-11771859IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.108.7E-10773858IPR015300DNA-binding pseudobarrel domain
PfamPF023626.9E-12784859IPR003340B3 DNA binding domain
CDDcd100171.30E-7789857No hitNo description
PROSITE profilePS508637.486795860IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009507Cellular Componentchloroplast
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 864 aa     Download sequence    Send to blast
MADQSLRSPT KPHFFKPLLP GFRTHLNIPV AFFSKHVEGR NDQNKTARLR SDASDETWLV  60
KMDGLKLTDG WEEFAFAHDL RIGDIVVFRH EGEMVFHVTA LGPSCCEIKY TSSLSHNMIN  120
DDQTNIVSRN SSRVKKHTRK KVESSSNHSR FVANVTAWGL SNDRLVRLVP CNLRKNLFNI  180
PLTFARLNGL NKMRGKKIYL HNEEGRSWKL GLVHDKAGMH TYFKSGWRSF CTANGISQGR  240
YTFQLVRKSA PPVIRLCRSN ERSAAESASD HSCFKRHVSP SSLRYDQLYL PRSFVSSNGL  300
DKRFGEIILK NEQGCKWPLV LKHSKPITTY LTKGWTNFCH VNGIKVGDFF KFKLAGTWEE  360
PVLSLCPAES NRDKTPLKCS EISNDVNPEE SEEETTGDKN ISRHYLDLKK RKYRSRCRAS  420
VENMDDDQTN IGKLLLYEMY SRLSRKDAKM RFSLFSITGN SSRVKRVKKN PRKKVESSSD  480
HSSFVANVTA SSLIYDRLYL PLIFERSNGL HKMSGERIVL LDGEGRSWNL NLKYNEAGMH  540
TYIRPGWTRF CDANGMSQGQ KFTFKLVQKA APPIMRLHLA KRRLISESSS HHSYLVGSVT  600
ASSLTTDRLC LSRSFVRSSG LDKGYEEIVL ENEWGKGWNL VLKHYKSCCS TILGGGWTTF  660
CQDNGLKPGD SFKFKLVGTG ERPVLSLFLA DSNHVSNHEK TPLECPEGSD DVKYLSSNSS  720
SGDDSSKSNE SGNESIDGRN KNNSQYSGEI KKRKYFWKCR ASSPSYTQDR FVTLTLTQSA  780
FQTYKLMNGI NKTGKITLLG QDGVKRVVDL FLDRICGRMR FGKGWREFCK AEGVKIDESF  840
VLELIWEEEA RPVFKFCTKV NSA*
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.1150s0033.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_023633067.10.0B3 domain-containing protein REM17
SwissprotQ84WP30.0REM17_ARATH; B3 domain-containing protein REM17
TrEMBLR0GPB90.0R0GPB9_9BRAS; Uncharacterized protein
STRINGCagra.1150s0033.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM57811648
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G26680.10.0B3 family protein
Publications ? help Back to Top
  1. Mantegazza O, et al.
    Analysis of the arabidopsis REM gene family predicts functions during flower development.
    Ann. Bot., 2014. 114(7): p. 1507-15
    [PMID:25002525]