PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10023579m
Common NameCARUB_v10023579mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family BBR-BPC
Protein Properties Length: 337aa    MW: 37686.9 Da    PI: 9.962
Description BBR-BPC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10023579mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GAGA_bind337.14.6e-103433361301
        GAGA_bind   1 mdddgsre..rnkg.yye................paaslkenlglqlmssiaerdakirernlalsekkaavaerdmaflqrdkalaernkalv 75 
                      m++ g+++  r+k+ y +                ++++l   +++++ms++aerda+++ern+a+s+kk+ava+rd+a++qrdkal+er+kal+
  Carubv10023579m  43 MENGGQYDngRFKPdYLKgaqsmwnmmpqhqikeQHNALV--MNKKIMSILAERDAAVHERNQAVSAKKEAVAARDEALQQRDKALSERDKALI 134
                      8888988888999967779999999998777754446666..99************************************************** PP

        GAGA_bind  76 erdnkllalllvenslasalpvgvqvlsgtksidslqqlsepqledsavelreeeklealpieeaaeeakekkkkkkrqrakkpkekkakkkkk 169
                      erdn+++al+++ensl++a       lsg k +d          +d++++ +e++kl ++p+++ ++e++++k +k+++    + ++ + k kk
  Carubv10023579m 135 ERDNAYAALQHHENSLNFA-------LSGGKRAD----------GDDCFG-TETHKLAVFPLSTIPPEVTNTKVNKRKK----ENKQGQVKLKK 206
                      **************96655.......58999998..........789999.88999************99999888332....33334557888 PP

        GAGA_bind 170 ksekskkkvkkesaderskaekksidlvlngvslDestlPvPvCsCtGalrqCYkWGnGGWqSaCCtttiSvyPLPvstkrrgaRiagrKmSqg 263
                       +e+ +++v++++  ++s+++++s+d++ln v++De+t+PvP+C+CtG+ rqCYkWGnGGWqS+CCttt+S+yPLP+++++r++R++grKmS++
  Carubv10023579m 207 VGEDLNRRVAAPG--KKSRTDWDSQDVGLNLVTFDETTMPVPMCTCTGSARQCYKWGNGGWQSSCCTTTLSQYPLPQMPNKRHSRMGGRKMSGN 298
                      8999*********..57***************************************************************************** PP

        GAGA_bind 264 afkklLekLaaeGydlsnpvDLkdhWAkHGtnkfvtir 301
                      +f++lL++LaaeGydls+pvDLkd+WA+HGtn+++ti+
  Carubv10023579m 299 VFSRLLSRLAAEGYDLSCPVDLKDYWARHGTNRYITIK 336
                      *************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM012261.2E-16643336IPR010409GAGA-binding transcriptional activator
PfamPF062172.0E-9455336IPR010409GAGA-binding transcriptional activator
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009723Biological Processresponse to ethylene
GO:0050793Biological Processregulation of developmental process
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0042803Molecular Functionprotein homodimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 337 aa     Download sequence    Send to blast
QTPSLSLSLS ISLSLSLSPI DLSSGSINQY PLFLPQKLHR FRMENGGQYD NGRFKPDYLK  60
GAQSMWNMMP QHQIKEQHNA LVMNKKIMSI LAERDAAVHE RNQAVSAKKE AVAARDEALQ  120
QRDKALSERD KALIERDNAY AALQHHENSL NFALSGGKRA DGDDCFGTET HKLAVFPLST  180
IPPEVTNTKV NKRKKENKQG QVKLKKVGED LNRRVAAPGK KSRTDWDSQD VGLNLVTFDE  240
TTMPVPMCTC TGSARQCYKW GNGGWQSSCC TTTLSQYPLP QMPNKRHSRM GGRKMSGNVF  300
SRLLSRLAAE GYDLSCPVDL KDYWARHGTN RYITIK*
Functional Description ? help Back to Top
Source Description
UniProtTranscriptional regulator that specifically binds to GA-rich elements (GAGA-repeats) present in regulatory sequences of genes involved in developmental processes. {ECO:0000269|PubMed:14731261}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCarubv10023579m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK2290900.0AK229090.1 Arabidopsis thaliana mRNA for hypothetical protein, complete cds, clone: RAFL16-37-J01.
GenBankAK3171650.0AK317165.1 Arabidopsis thaliana AT2G21240 mRNA, complete cds, clone: RAFL21-54-A06.
GenBankAY0845540.0AY084554.1 Arabidopsis thaliana clone 111536 mRNA, complete sequence.
GenBankAY3805700.0AY380570.1 Arabidopsis thaliana basic pentacysteine 4 (BPC4) mRNA, complete cds.
GenBankBT0261090.0BT026109.1 Arabidopsis thaliana At2g21240 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006294544.10.0protein BASIC PENTACYSTEINE4
RefseqXP_023639537.10.0protein BASIC PENTACYSTEINE4
SwissprotQ8S8C60.0BPC4_ARATH; Protein BASIC PENTACYSTEINE4
TrEMBLR0HTS80.0R0HTS8_9BRAS; Uncharacterized protein (Fragment)
STRINGXP_006294545.10.0(Capsella rubella)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM40902659
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G21240.20.0basic pentacysteine 4