PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.007G209200.1
Common NameB456_007G209200
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family B3
Protein Properties Length: 632aa    MW: 71855.4 Da    PI: 9.2673
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.007G209200.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B350.73.3e-16931842100
                         EEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE.. CS
                  B3   2 fkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefel 92 
                         fkv   +++ + g l +p kfa+++g+k   s  + +e ++g++We++l   k +g+ + ++GW++F +   L+ g f+vF+++++ +f  
  Gorai.007G209200.1  93 FKVI-LESTIRDGKLGIPHKFAKDYGSK--LSSPVFFEVPNGEVWELEL--MKLDGKLWVQNGWRKFTEHYSLELGHFLVFRYQRNCRF-- 176
                         5555.356677788***********988..5557***************..********************************999999.. PP

                         EEEEE-SS CS
                  B3  93 vvkvfrks 100
                         +v +f++s
  Gorai.007G209200.1 177 HVLIFDRS 184
                         *****985 PP

2B349.95.8e-162603361386
                         T...T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE....EEETTE.....EEE-TTHHHHHHHHT--TT-EEEEEE-S CS
                  B3  13 s...grlvlpkkfaeehggkkeesktltledesgrsWevkliy..rkksgr.....yvltkGWkeFvkangLkegDfvvFkldg 86 
                         +   ++l++pk+f +++ ++  ++ +++l d+sg++W+ +++        +     + l +GW  Fv++n+L++gD++ F+l++
  Gorai.007G209200.1 260 HskvSCLSIPKEFSRKYLMD--QG-DVILCDSSGNTWSAQYRAtlG----MngqpyVKLLNGWDAFVRDNNLQVGDVCAFELID 336
                         4445669**********656..22.7***************66541....22245577889********************875 PP

3B358.31.3e-18407503197
                         EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE...EEE-TTHHHHHHHHT--TT-EEEEEE-SSS CS
                  B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr...yvltkGWkeFvkangLkegDfvvFkldgrs 88 
                         f+ vl ps+v +++ l+++++f+++h + + +++ + l+ ++g+sW  ++    +s +     + +GW+ Fv++n+Lk+gD++vF+l++++
  Gorai.007G209200.1 407 FVVVLRPSYVQSHA-LCISNDFTRKHFKTTLTNIGIALRLSNGKSWPAEYHQ--RSIGnpnARICNGWRAFVNDNKLKVGDVCVFELVSDT 494
                         67788888888887.***********888899999***************33..333334677889*********************9988 PP

                         EE..EEEEE CS
                  B3  89 efelvvkvf 97 
                         + + +v +f
  Gorai.007G209200.1 495 QISVKVIIF 503
                         877777666 PP

4B340.26.2e-13543625995
                         HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
                  B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                         +  k++++ +p +f+e++ +++ ++  l+l +   r+W v+++ + + +   lt+GW +F+++n L+egD++v +ld  ++  l+v+
  Gorai.007G209200.1 543 HL-KEDCVDIPFRFVEQYFEPNVQK--LIL-QVADRTWPVEITSNPRIRIAKLTSGWIKFARENSLREGDICVYELDTVDNNLLKVS 625
                         44.45559**********7675564..554.4477*******77777777*************************987666556665 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019363.14E-2485184IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.8E-2386184IPR015300DNA-binding pseudobarrel domain
CDDcd100171.01E-1891182No hitNo description
PROSITE profilePS5086313.67691184IPR003340B3 DNA binding domain
PfamPF023622.4E-1392183IPR003340B3 DNA binding domain
SMARTSM010193.4E-1892184IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.109.2E-20240348IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.59E-20243348IPR015300DNA-binding pseudobarrel domain
CDDcd100173.93E-19246348No hitNo description
SMARTSM010191.0E-9248350IPR003340B3 DNA binding domain
PROSITE profilePS5086313.437251350IPR003340B3 DNA binding domain
PfamPF023626.0E-14263336IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.4E-23400501IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.49E-22400500IPR015300DNA-binding pseudobarrel domain
CDDcd100178.66E-18405501No hitNo description
PROSITE profilePS5086312.971406506IPR003340B3 DNA binding domain
PfamPF023621.1E-16407504IPR003340B3 DNA binding domain
SMARTSM010195.6E-14407506IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.8E-16530627IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.88E-13533624IPR015300DNA-binding pseudobarrel domain
CDDcd100173.02E-14534628No hitNo description
PROSITE profilePS5086311.984534630IPR003340B3 DNA binding domain
SMARTSM010196.3E-10535630IPR003340B3 DNA binding domain
PfamPF023622.2E-10543625IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 632 aa     Download sequence    Send to blast
HLLKRTKFSS TSLGYNWLTV IITHHQDLKI SRNNGVLVGN FEKKNYEYIN HLFVIAVLHK  60
FSKNLISELP SHSTMAYHHG RVDGYSPEPY HFFKVILEST IRDGKLGIPH KFAKDYGSKL  120
SSPVFFEVPN GEVWELELMK LDGKLWVQNG WRKFTEHYSL ELGHFLVFRY QRNCRFHVLI  180
FDRSASEIHY PYANNISTDM TKKPVLQCPR PPKIMRASNN GFSATKGNEK SKALERASSA  240
IKSGNPFFLV FMQPTYVGLH SKVSCLSIPK EFSRKYLMDQ GDVILCDSSG NTWSAQYRAT  300
LGMNGQPYVK LLNGWDAFVR DNNLQVGDVC AFELIDCIDI SFQVVIYSSK KADFHGSPAQ  360
MEAGMTLTRS ECLEPVKARE TAFHREMKVR EKALQRALAF TSENPFFVVV LRPSYVQSHA  420
LCISNDFTRK HFKTTLTNIG IALRLSNGKS WPAEYHQRSI GNPNARICNG WRAFVNDNKL  480
KVGDVCVFEL VSDTQISVKV IIFQAIADED SHPSQGASEE ALGLTSQSAK APLIFRRVVL  540
PLHLKEDCVD IPFRFVEQYF EPNVQKLILQ VADRTWPVEI TSNPRIRIAK LTSGWIKFAR  600
ENSLREGDIC VYELDTVDNN LLKVSISKYA S*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012491737.10.0PREDICTED: B3 domain-containing protein LOC_Os12g40080-like isoform X1
RefseqXP_012491738.10.0PREDICTED: B3 domain-containing protein LOC_Os12g40080-like isoform X1
TrEMBLA0A0D2TIU70.0A0A0D2TIU7_GOSRA; Uncharacterized protein (Fragment)
STRINGGorai.007G209200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1834349
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.12e-24B3 family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]