PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.008G071300.1
Common NameB456_008G071300, LOC105763318
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family B3
Protein Properties Length: 733aa    MW: 82692.7 Da    PI: 8.7262
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.008G071300.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B352.96.9e-1727110998
                         HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
                  B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                          + + + l +p+kf++++g+    s  + l+ +sg +W  +l  +k++g +++++GW+ F++   L+ g f+vF+++g+ +f  +v +f+
  Gorai.008G071300.1  27 ATIRDKKLGIPRKFVRKYGK--GLSSSVLLTVPSGDTWHAQL--TKSDGVVWFQNGWQAFAEYYSLQYGHFLVFRYEGNGKF--LVLIFD 110
                         34455669**********84..47778***************..********************************998777..999997 PP

2B355.51e-17235333198
                         EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSS CS
                  B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkl.iy..rkksgryvltkGWkeFvkangLkegDfvvFkldgrs 88 
                         f+ ++ ps+ +  + +++pk+f+ ++  k++ + +ltl +++g++W+ ++ +y  r+k  + ++  GW++F  +n+L++gD++vF+l++++
  Gorai.008G071300.1 235 FLVIMQPSYINPGRKMCIPKEFTMKFL-KENLG-DLTLCTSEGKTWSTQYwRYisRNKYTKAIIHIGWRQFMLDNNLEAGDVCVFELISQT 323
                         667788888888899********8884.44454.9***************433777777779999***********************987 PP

                         EE..EEEEE- CS
                  B3  89 efelvvkvfr 98 
                         e  l+v ++r
  Gorai.008G071300.1 324 ESMLKVIIYR 333
                         7767777766 PP

3B356.45.5e-18397495199
                         EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSS CS
                  B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegDfvvFkldgrs 88 
                         f+ v+ p +   ++ l +p kf+++h  +  e+ + +l+ ++gr+W vk+++   ++++ +    + W+ F+++n+L++gD++vF+l++r+
  Gorai.008G071300.1 397 FKVVMQPRYLILRCSLGIPYKFVNRHLDE--EKEEAILRVSDGRTWVVKFTVkvfTGGQHKA-EFSTWRAFARDNNLEVGDVCVFELINRH 484
                         4455667777788*************433..23479**************776543333333.3369*********************998 PP

                         EE..EEEEE-S CS
                  B3  89 efelvvkvfrk 99 
                         e +++v++f++
  Gorai.008G071300.1 485 ENSFKVSIFSA 495
                         8889***9985 PP

4B351.91.4e-16626717191
                         EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEETTS-EEEEEE....EEETTE..EEE-TTHHHHHHHHT--TT-EEEEEE- CS
                  B3   1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltledesgrsWevkliy..rkksgr..yvltkGWkeFvkangLkegDfvvFkld 85 
                         f  ++ ps+v++  rl +p +f +++  +    ++  +  +  +g++W  k+    +++       l +GWk F+k+n+L+ gD++vF+++
  Gorai.008G071300.1 626 FTVAMQPSYVSNGYRLAIPLDFSRKYlrN---GSGNAILSMVGDGKTWLTKYHReaKGT--NprAKLIDGWKTFAKDNNLEIGDVCVFEMI 711
                         677899********************642...34445666678*********5544333..24588999********************98 PP

                         SSSEE. CS
                  B3  86 grsefe 91 
                         ++++ +
  Gorai.008G071300.1 712 NSEGYQ 717
                         655443 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.104.3E-2612112IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.77E-2414112IPR015300DNA-binding pseudobarrel domain
CDDcd100173.30E-2018110No hitNo description
PROSITE profilePS5086313.91619112IPR003340B3 DNA binding domain
SMARTSM010191.2E-1720112IPR003340B3 DNA binding domain
PfamPF023629.0E-1428110IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.5E-22226333IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.28E-20228333IPR015300DNA-binding pseudobarrel domain
CDDcd100172.09E-22233333No hitNo description
PROSITE profilePS5086314.254235335IPR003340B3 DNA binding domain
PfamPF023623.2E-16235333IPR003340B3 DNA binding domain
SMARTSM010193.7E-11235335IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.103.7E-21388494IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.89E-21390494IPR015300DNA-binding pseudobarrel domain
CDDcd100176.21E-23395494No hitNo description
PROSITE profilePS5086313.888397496IPR003340B3 DNA binding domain
PfamPF023621.0E-15397495IPR003340B3 DNA binding domain
SMARTSM010191.8E-14397496IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.9E-21619727IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.16E-21620716IPR015300DNA-binding pseudobarrel domain
CDDcd100171.85E-21624726No hitNo description
PfamPF023624.2E-15626715IPR003340B3 DNA binding domain
PROSITE profilePS5086314.579626728IPR003340B3 DNA binding domain
SMARTSM010193.3E-14626728IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 733 aa     Download sequence    Send to blast
MPHSLKANNG SMFASKTPHF FKIILEATIR DKKLGIPRKF VRKYGKGLSS SVLLTVPSGD  60
TWHAQLTKSD GVVWFQNGWQ AFAEYYSLQY GHFLVFRYEG NGKFLVLIFD MSASEIEYPC  120
KSHIEDHNSD DQVCLKLVNK EAKDDTCDGT LYETPPCKET RKKKKKKRSR PPCSKPRKKL  180
KTTQKDKNEK DWEDESTSED DMQTKVPRDE HAFGVIEYDK ALQRASSFRS ENPFFLVIMQ  240
PSYINPGRKM CIPKEFTMKF LKENLGDLTL CTSEGKTWST QYWRYISRNK YTKAIIHIGW  300
RQFMLDNNLE AGDVCVFELI SQTESMLKVI IYRVRQDTSC SSPLGGINSS ENGGNVNSST  360
LGSTESNHDC LMRPMTPVEK ARAILKASNF KSKNPFFKVV MQPRYLILRC SLGIPYKFVN  420
RHLDEEKEEA ILRVSDGRTW VVKFTVKVFT GGQHKAEFST WRAFARDNNL EVGDVCVFEL  480
INRHENSFKV SIFSAAPGAN SSLSPQADDA EASQVASKNC LVPRIEADDD FGNCYAGNST  540
DDAIASQVAS KDWLVPKIEA DDDFGKCHVG NSSSAAQFPA IGYQETEEEV QPTISTRPRG  600
PQRLQAGEKA KALQRASGFK SQNPFFTVAM QPSYVSNGYR LAIPLDFSRK YLRNGSGNAI  660
LSMVGDGKTW LTKYHREAKG TNPRAKLIDG WKTFAKDNNL EIGDVCVFEM INSEGYQLSL  720
NVAIYKPQED QT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A5e-1721772025144B3 domain-containing transcription factor VRN1
4i1k_B5e-1721772025144B3 domain-containing transcription factor VRN1
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1161167KKKKKKR
2163178KKKKRSRPPCSKPRKK
3164179KKKRSRPPCSKPRKKL
4165178KKRSRPPCSKPRKK
5175181PRKKLKT
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6167621e-147JX616762.1 Gossypium hirsutum clone NBRI_GE61640 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012436969.10.0PREDICTED: B3 domain-containing protein Os03g0620400-like
TrEMBLA0A0D2SWH20.0A0A0D2SWH2_GOSRA; Uncharacterized protein
STRINGGorai.008G071300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16925512
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.11e-48B3 family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]