PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_35970_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family B3
Protein Properties Length: 557aa    MW: 63318.7 Da    PI: 9.3294
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_35970_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B350.92.7e-16191102100
                                 EEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE CS
                          B3   2 fkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkl 84 
                                 fkv   +++ + g l +p kfa+++g+k   s  + ++ ++g++We++l   k +g+ + ++GW++F +   L+ g f+vF++
  Cotton_A_35970_BGI-A2_v1.0  19 FKVI-LESTIRDGKLGIPHKFAKDYGSK--LSSPVFFQVPNGEVWELEL--MKLDGKLWVQNGWRKFTEHYSLELGHFLVFRY 96 
                                 5555.356677788***********988..5557***************..******************************** PP

                                 -SSSEE..EEEEE-SS CS
                          B3  85 dgrsefelvvkvfrks 100
                                 +g+ +f  +v +f+ks
  Cotton_A_35970_BGI-A2_v1.0  97 QGNCRF--HVLIFDKS 110
                                 **9999..*****985 PP

2B3521.3e-161862611385
                                 T...T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE....EEETTE.....EEE-TTHHHHHHHHT--TT-EEEEEE- CS
                          B3  13 s...grlvlpkkfaeehggkkeesktltledesgrsWevkliy..rkksgr.....yvltkGWkeFvkangLkegDfvvFkld 85 
                                 +   ++l++pk+fa+++ +++ +  +++l d+sg++W+ +++        +     + l +GW  Fv++n+L++gD++ F+l+
  Cotton_A_35970_BGI-A2_v1.0 186 HskvSCLSIPKEFARKYLMDH-G--DVILCDSSGNTWSAQYRAtlG----MngqpyVKLLNGWDAFVRDNNLQVGDVCAFELT 261
                                 4445669**********6663.3..6***************66541....22245577889********************85 PP

3B3581.7e-18333430198
                                 EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE...EEE-TTHHHHHHHHT--TT-EE CS
                          B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr...yvltkGWkeFvkangLkegDfv 80 
                                 f+ vl ps+v +++ l+++++f+++h + + +++ + l+ ++g+sW  ++    +s +     + +GW+ Fv++n+Lk+gD++
  Cotton_A_35970_BGI-A2_v1.0 333 FVVVLRPSYVQSHA-LCISNDFTRKHFKTTLTNVGIALRLSNGKSWPAEYHQ--RSIGnpnARICNGWRAFVNDNKLKVGDVC 412
                                 67788888888887.***********888899999***************33..333334677889***************** PP

                                 EEEE-SSSEE..EEEEE- CS
                          B3  81 vFkldgrsefelvvkvfr 98 
                                 vF+l+++ + +++v +f+
  Cotton_A_35970_BGI-A2_v1.0 413 VFELVSDAQISFKVIIFQ 430
                                 ****99877777777775 PP

4B340.45.3e-13469551995
                                 HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE. CS
                          B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefe 91 
                                 +  k++++ +p +f+e++ +++ ++  l+l +   r+W v+++ + + +   lt+GW +F+++n L+egD++v +ld  ++  
  Cotton_A_35970_BGI-A2_v1.0 469 HL-KEDCVDIPFRFVEQYFEPNVQK--LIL-QVADRTWPVEITSNPRIRIAKLTSGWIKFARENSLREGDICVYELDTVNNNL 547
                                 44.45559**********7675564..554.4477*******77777777*************************98766655 PP

                                 .EEE CS
                          B3  92 lvvk 95 
                                 l+v+
  Cotton_A_35970_BGI-A2_v1.0 548 LKVS 551
                                 6665 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019369.02E-2511110IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.108.3E-2412110IPR015300DNA-binding pseudobarrel domain
CDDcd100171.35E-1817108No hitNo description
PROSITE profilePS5086313.84617110IPR003340B3 DNA binding domain
SMARTSM010195.6E-1918110IPR003340B3 DNA binding domain
PfamPF023622.1E-1318109IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.107.2E-21166276IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.28E-22169275IPR015300DNA-binding pseudobarrel domain
CDDcd100171.18E-19172274No hitNo description
SMARTSM010191.0E-7174276IPR003340B3 DNA binding domain
PROSITE profilePS5086313.606177276IPR003340B3 DNA binding domain
PfamPF023622.5E-14189262IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.9E-24326430IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.94E-22326430IPR015300DNA-binding pseudobarrel domain
CDDcd100171.81E-19331429No hitNo description
PROSITE profilePS5086313.394332432IPR003340B3 DNA binding domain
SMARTSM010192.3E-13333432IPR003340B3 DNA binding domain
PfamPF023621.1E-16333430IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.4E-16456553IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.47E-13459550IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.956460556IPR003340B3 DNA binding domain
CDDcd100173.29E-14460554No hitNo description
SMARTSM010197.4E-10461556IPR003340B3 DNA binding domain
PfamPF023621.9E-10469551IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 557 aa     Download sequence    Send to blast
MTSHHGRVDG YSPEPYHFFK VILESTIRDG KLGIPHKFAK DYGSKLSSPV FFQVPNGEVW  60
ELELMKLDGK LWVQNGWRKF TEHYSLELGH FLVFRYQGNC RFHVLIFDKS ASEIHYPYAN  120
NISTDMTKKP VLQCPRPPKI MRASNNGFSA TKGNEKSKAL ERASSAFKSG NPFFLVFMQP  180
TYVGLHSKVS CLSIPKEFAR KYLMDHGDVI LCDSSGNTWS AQYRATLGMN GQPYVKLLNG  240
WDAFVRDNNL QVGDVCAFEL TNCIDISFQV FIYRSKKADF HGSPAQMEAG ITLTRSECLR  300
PVKARETAFH REMKVREKAL QRALAFTSEN PFFVVVLRPS YVQSHALCIS NDFTRKHFKT  360
TLTNVGIALR LSNGKSWPAE YHQRSIGNPN ARICNGWRAF VNDNKLKVGD VCVFELVSDA  420
QISFKVIIFQ AIADEDTHPL QGVSEEALGL TSQSAKAPLI FRRVVLPLHL KEDCVDIPFR  480
FVEQYFEPNV QKLILQVADR TWPVEITSNP RIRIAKLTSG WIKFARENSL REGDICVYEL  540
DTVNNNLLKV SISKYAS
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017635635.10.0PREDICTED: B3 domain-containing protein LOC_Os12g40080-like
TrEMBLA0A2P5SLU30.0A0A2P5SLU3_GOSBA; Uncharacterized protein
STRINGGorai.007G209200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1834349
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.11e-25B3 family protein