PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.008G071200.1
Common NameB456_008G071200
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family B3
Protein Properties Length: 733aa    MW: 82543.2 Da    PI: 7.1328
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.008G071200.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B349.76.6e-16301131099
                         HHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                  B3  10 vlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                         + + + l +p kf++++g+    s  + l+ +sg +W v+l  +k++g ++l++GW+eF +   Lk g  +vFk++g+ +f  +v +f++
  Gorai.008G071200.1  30 NIRDKKLEIPGKFVRKYGNG--LSNSVLLTVPSGDTWHVEL--TKSDGIVWLQNGWQEFSEYFSLKYGHLLVFKYEGNGKF--LVLIFDT 113
                         3445669**********855..7788***************..********************************998777..****997 PP

2B352.31e-16345442197
                         EEEE-..-HHHHTT-EE--HHH.HTT.---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS CS
                  B3   1 ffkvltpsdvlksgrlvlpkkfaeeh.ggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegDfvvFkldgr 87 
                         f+ ++ ps+ +  + +++pk+f+ ++         +ltl +++gr+W+ ++     r+k  + ++  GW++F  +n+L++gD++vF+l+++
  Gorai.008G071200.1 345 FLVTMQPSYINPGRKMCIPKNFTMKFlT-R--DLGDLTLCTSDGRTWSAQYLCymtRNKYTKATIHIGWRQFMLDNNLEAGDVCVFELISQ 432
                         667788888888899********88842.2..2338***************4448877778899************************987 PP

                         SEE..EEEEE CS
                  B3  88 sefelvvkvf 97 
                         +e  l+v ++
  Gorai.008G071200.1 433 TEIMLKVIIY 442
                         6666666655 PP

3B3598.2e-19507605298
                         EEE-...-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSS CS
                  B3   2 fkvlt.psdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegDfvvFkldgrs 88 
                         fkv + p + + ++ l++p kf++++  +  e+ + +l+ ++gr+W vk+ +   ++++ +  ++  W+ F+++n+L++gD++vF+l++r+
  Gorai.008G071200.1 507 FKVVMqPRYLTIRCSLSIPYKFVNQYLDE--EKEEAILQVSDGRTWVVKFAVkvvTGGQHKAEFSHTWRAFARDNNLEVGDVCVFELINRN 595
                         5555505555566**********999433..23479**************77766666666889999*********************998 PP

                         EE..EEEEE- CS
                  B3  89 efelvvkvfr 98 
                         e +++v++f+
  Gorai.008G071200.1 596 ENSFKVSIFS 605
                         8889***997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.105.8E-2513114IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.55E-2415114IPR015300DNA-binding pseudobarrel domain
CDDcd100178.27E-2020112No hitNo description
PROSITE profilePS5086314.15621114IPR003340B3 DNA binding domain
SMARTSM010193.0E-1522114IPR003340B3 DNA binding domain
PfamPF023623.3E-1330113IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.107.0E-21337442IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.18E-20338443IPR015300DNA-binding pseudobarrel domain
CDDcd100178.19E-20343443No hitNo description
SMARTSM010191.1E-10345445IPR003340B3 DNA binding domain
PROSITE profilePS5086313.38345445IPR003340B3 DNA binding domain
PfamPF023623.1E-15345443IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.3E-23498605IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.14E-21500605IPR015300DNA-binding pseudobarrel domain
CDDcd100173.02E-23505605No hitNo description
PfamPF023627.9E-17506606IPR003340B3 DNA binding domain
SMARTSM010192.9E-12507607IPR003340B3 DNA binding domain
PROSITE profilePS5086314.522507607IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 733 aa     Download sequence    Send to blast
MPKSLVRKEN NNSMFTPKTP HFFKIILDAN IRDKKLEIPG KFVRKYGNGL SNSVLLTVPS  60
GDTWHVELTK SDGIVWLQNG WQEFSEYFSL KYGHLLVFKY EGNGKFLVLI FDTSASEIEY  120
PCKSHIEDQE SDDQMCLKPV KEEPKDDVCD KPLHGTPSCK ETKRKKSRSL CSNAEYLCKS  180
YIDDHKSDDQ LCLKPVKEEA KDGVYDETLY ETPPCKKARR TKSQSQCSKI EYPCKSHIED  240
HKSEDQVSLK PVKEEAKDDV CDKTLYETPP CKETRKKKPQ SQCSKPRKKL KTAQKDKNRK  300
DCEDASTDEE GLQTKVPRDG RAFGAIEYDK ALQRAFSFES ENPFFLVTMQ PSYINPGRKM  360
CIPKNFTMKF LTRDLGDLTL CTSDGRTWSA QYLCYMTRNK YTKATIHIGW RQFMLDNNLE  420
AGDVCVFELI SQTEIMLKVI IYRVRQDASC SSPLGGINSL ENGDNVSSST PGSTESKHHC  480
SIRSLTPHEK ARAIQKASNF KSKNPFFKVV MQPRYLTIRC SLSIPYKFVN QYLDEEKEEA  540
ILQVSDGRTW VVKFAVKVVT GGQHKAEFSH TWRAFARDNN LEVGDVCVFE LINRNENSFK  600
VSIFSEAPGA NSSLSPQAHD VKASQVASKN CSVPKIEADD DFGNCYAGNP GPAAQPTTIG  660
HQENEEEVNP TDDAIASQVA SKDFLVPKVE VDDDFGKCYA GNFSPAAQIT TTGYQEAEEE  720
VKPTISTGPR GP*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A3e-1832160425144B3 domain-containing transcription factor VRN1
4i1k_B3e-1832160425144B3 domain-containing transcription factor VRN1
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1285291PRKKLKT
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6167622e-58JX616762.1 Gossypium hirsutum clone NBRI_GE61640 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012436967.10.0PREDICTED: uncharacterized protein LOC105763317 isoform X1
TrEMBLA0A0D2T1X10.0A0A0D2T1X1_GOSRA; Uncharacterized protein
STRINGGorai.008G071200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16925512
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.14e-29B3 family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]