PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Tp5g23120
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassicaceae incertae sedis; Schrenkiella
Family BBR-BPC
Protein Properties Length: 283aa    MW: 31768.1 Da    PI: 9.7414
Description BBR-BPC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Tp5g23120genomethellungiellaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GAGA_bind307.74.1e-9422831301
  GAGA_bind   1 mdddgsrernkgyyepaaslkenlglqlmssiaerdaki..rernlalsekkaavaerd......................maflqrdkalaernkalve 76 
                md+dg  +rn+gyyep+ ++k nlg+ql+ si +r++k+   ++n +++++++ +                          m++    ++l++      +
  Tp5g23120   2 MDEDGLSNRNWGYYEPS-QFKPNLGFQLIPSIVDRNEKPflCTQNPNFITPNNGYRGSSssssssnvmsfardysvsdapfMSY----SWLNQ------H 90 
                9***************9.*********************997777778888888855553344444433332244444444444....55555......6 PP

  GAGA_bind  77 rdnkllalllvenslasalpvgvqvlsgtksidslqqlsepqledsavelreeeklealpieeaaeeakekkkkkkrqrakkpkekkakkkkkksekskk 176
                rd+k+++   +  +      + +  +++ + ++ +q     q                +p + +++e+ ++++++ +   + pk+kk++k k+       
  Tp5g23120  91 RDSKFFNSSINPSN------HHSLLVPDNSRTHPMQL---LQ--------------IPKPEVGEVDESLKRTQCSGG-DRAGPKAKKERKLKD------- 159
                66666666553332......12334555555555554...11..............122223456777888888855.456789999999888....... PP

  GAGA_bind 177 kvkkesaderskaekksidlvlngvslDestlPvPvCsCtGalrqCYkWGnGGWqSaCCtttiSvyPLPvstkrrgaRiagrKmSqgafkklLekLaaeG 276
                  + +++++++   +ksi++v+ngvs+D++ lPvPvCsCtG+++ CY+WG+GGWqSaCCtt++S+yPLP+stkrrgaRiagrKmSqgaf+k+LekL+a+G
  Tp5g23120 160 -CNVPRVQRERSPLRKSIEMVINGVSMDIGCLPVPVCSCTGMPQPCYRWGCGGWQSACCTTNVSMYPLPMSTKRRGARIAGRKMSQGAFRKVLEKLSADG 258
                .4455555566789************************************************************************************** PP

  GAGA_bind 277 ydlsnpvDLkdhWAkHGtnkfvtir 301
                +d+snp+DLk+hWAkHGtnkfvtir
  Tp5g23120 259 FDFSNPIDLKSHWAKHGTNKFVTIR 283
                ************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM012264.6E-1372283IPR010409GAGA-binding transcriptional activator
PfamPF062177.9E-852283IPR010409GAGA-binding transcriptional activator
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 283 aa     Download sequence    Send to blast
MMDEDGLSNR NWGYYEPSQF KPNLGFQLIP SIVDRNEKPF LCTQNPNFIT PNNGYRGSSS  60
SSSSSNVMSF ARDYSVSDAP FMSYSWLNQH RDSKFFNSSI NPSNHHSLLV PDNSRTHPMQ  120
LLQIPKPEVG EVDESLKRTQ CSGGDRAGPK AKKERKLKDC NVPRVQRERS PLRKSIEMVI  180
NGVSMDIGCL PVPVCSCTGM PQPCYRWGCG GWQSACCTTN VSMYPLPMST KRRGARIAGR  240
KMSQGAFRKV LEKLSADGFD FSNPIDLKSH WAKHGTNKFV TIR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1133138ESLKRT
Functional Description ? help Back to Top
Source Description
UniProtTranscriptional regulator that specifically binds to GA-rich elements (GAGA-repeats) present in regulatory sequences of genes involved in developmental processes. {ECO:0000269|PubMed:14731261}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapTp5g23120
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0125631e-133AC012563.7 Arabidopsis thaliana chromosome 1 BAC T23K23 genomic sequence, complete sequence.
GenBankBT0221171e-133BT022117.1 Arabidopsis thaliana At1g68120 mRNA, complete cds.
GenBankCP0026841e-133CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006391183.10.0protein BASIC PENTACYSTEINE3
SwissprotQ9C9X61e-147BPC3_ARATH; Protein BASIC PENTACYSTEINE3
TrEMBLV4KBG90.0V4KBG9_EUTSA; Uncharacterized protein
STRINGXP_006391183.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM153471518
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G68120.11e-148basic pentacysteine 3