PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D12G0646
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family B3
Protein Properties Length: 926aa    MW: 104039 Da    PI: 9.4845
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D12G0646genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B352.31e-16200283998
                  HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
           B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                   + + + l +p+kf++++g+    s  + l+ +sg +W  +l  +k++g +++++GW+ F++   L+ g f+vF+++g+ +f  +v +f+
  Gh_D12G0646 200 ATIRDKKLGIPRKFVKKYGK--GLSSSVLLTVPSGDTWHAQL--TKSDGVVWFQNGWQAFAEYYSLQYGHFLVFRYEGNGKF--LVLIFD 283
                  34455669**********84..47778***************..********************************998777..999997 PP

2B354.81.7e-17409507198
                  EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
           B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkl.iy..rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                  f+ ++ ps+ +  + +++pk+f+ ++  k++ + +ltl +++g++W+ ++ +y  r+k  + ++  GW++F  an+L++gD++vF+l++++e  l+v 
  Gh_D12G0646 409 FLVIMQPSYINPGRKMCIPKEFTMKFL-KENLG-DLTLCTSEGKTWSTQYwRYisRNKYTKAIIHIGWRQFMLANNLEAGDVCVFELISQTESMLKVI 504
                  666788888888899********8884.44454.9***************433777777779999***********************9877767777 PP

                  EE- CS
           B3  96 vfr 98 
                  ++r
  Gh_D12G0646 505 IYR 507
                  766 PP

3B356.84.2e-18571669199
                  EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEE CS
           B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvk 95 
                  f+ v+ p +   ++ l +p kf+++h  ++  + + +l+ ++gr+W vk+++   ++++ +    + W+ F+++n+L++gD++vF+l++r+e +++v+
  Gh_D12G0646 571 FKVVMQPRYLILRCSLGIPYKFVKRHLDEE--KEEAILRVSDGRTWVVKFTVkvfTGGQHKA-EFSTWRAFARDNNLEVGDVCVFELINRHENSFKVS 665
                  4455667777788*************4332..3479**************776543333333.3369*********************9988889*** PP

                  EE-S CS
           B3  96 vfrk 99 
                  +f++
  Gh_D12G0646 666 IFSA 669
                  9985 PP

4B351.71.6e-16820911191
                  EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEETTS-EEEEEE....EEETTE..EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE. CS
           B3   1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltledesgrsWevkliy..rkksgr..yvltkGWkeFvkangLkegDfvvFkldgrsefe 91 
                  f+ ++ ps+v++  rl +p +f +++  +    ++  +  +  +g++W  k+    +++       l +GWk F+k+n+L+ gD++vF+++++++ +
  Gh_D12G0646 820 FIVAMQPSYVSNGYRLAIPLDFSRKYlrN---GSGNAILSMVGDGKTWLTKYHReaKGT--NprAKLIDGWKTFAKDNNLEIGDVCVFEMINSEGYQ 911
                  788999********************642...34445666678*********5544333..24588999********************98655443 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.105.6E-26185285IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.55E-24187285IPR015300DNA-binding pseudobarrel domain
CDDcd100172.01E-20191283No hitNo description
PROSITE profilePS5086313.86192285IPR003340B3 DNA binding domain
SMARTSM010192.2E-17193285IPR003340B3 DNA binding domain
PfamPF023621.5E-13201283IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.106.5E-22400507IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019363.34E-20402507IPR015300DNA-binding pseudobarrel domain
CDDcd100171.72E-22407507No hitNo description
SMARTSM010193.8E-11409509IPR003340B3 DNA binding domain
PfamPF023624.9E-16409507IPR003340B3 DNA binding domain
PROSITE profilePS5086314.184409509IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.6E-21562668IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019364.32E-21564668IPR015300DNA-binding pseudobarrel domain
CDDcd100174.65E-24569668No hitNo description
SMARTSM010193.7E-15571670IPR003340B3 DNA binding domain
PROSITE profilePS5086313.902571670IPR003340B3 DNA binding domain
PfamPF023627.5E-16571669IPR003340B3 DNA binding domain
SuperFamilySSF1019364.9E-21813908IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.103.1E-21813921IPR015300DNA-binding pseudobarrel domain
CDDcd100171.06E-21818920No hitNo description
SMARTSM010195.2E-14820922IPR003340B3 DNA binding domain
PROSITE profilePS5086314.579820922IPR003340B3 DNA binding domain
PfamPF023625.7E-15820909IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 926 aa     Download sequence    Send to blast
MKVQTYPVVK NASKACGTQG DIVSRVKTRS LVSDAKRSCQ QSARLPSSRE FIDLTDSYIE  60
SLDDSPLDQK TKKKMTSPSF QPCKLRTNPS ESNQAKGIKL EKEKKSLNFQ YLTKEVGGEL  120
KYSVKNDSGR RSSARRRPKP DPVYGKQRAC VGASAFRTSN PSFSVVIQPS YIGSSSALKA  180
NNGSMFASKT PHFFKIILEA TIRDKKLGIP RKFVKKYGKG LSSSVLLTVP SGDTWHAQLT  240
KSDGVVWFQN GWQAFAEYYS LQYGHFLVFR YEGNGKFLVL IFDMSASEIE YPCKSHIEDH  300
NSDDQVCLKL VKKEAKDDTC DGTLYETPPC KETRKKKKKK KRSRPPCSKP PKKLKTTQKD  360
KNEKDWEDES TSEDDMQTKV PRDEHAFGVI EYDKALQRAS SFRSENPFFL VIMQPSYINP  420
GRKMCIPKEF TMKFLKENLG DLTLCTSEGK TWSTQYWRYI SRNKYTKAII HIGWRQFMLA  480
NNLEAGDVCV FELISQTESM LKVIIYRVRQ DTSCSSPLGG INSSENGGNV NSSTLGSTKS  540
NHDCLMRPMT PVEKARAILK ASNFKSKNPF FKVVMQPRYL ILRCSLGIPY KFVKRHLDEE  600
KEEAILRVSD GRTWVVKFTV KVFTGGQHKA EFSTWRAFAR DNNLEVGDVC VFELINRHEN  660
SFKVSIFSAA PGANSSLSPQ ADDAEASQVA SKNCLVPRIE ADDDFGNCYA GNSSPAAQLT  720
TIEYQENEEE ANLTDDAIAS QVASKDWLVP KIEADDDFGK CHVGNSSSAA QFPAIGYQET  780
EEEVQPTIST RPRGPQRLQA GEKAKALQRA SGFKSQNPFF IVAMQPSYVS NGYRLAIPLD  840
FSRKYLRNGS GNAILSMVGD GKTWLTKYHR EAKGTNPRAK LIDGWKTFAK DNNLEIGDVC  900
VFEMINSEGY QLSLNVAIYK PQEDQT
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A2e-1639191425144B3 domain-containing transcription factor VRN1
4i1k_B2e-1639191425144B3 domain-containing transcription factor VRN1
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1333341RKKKKKKKR
2333354RKKKKKKKRSRPPCSKPPKKLK
3335341KKKKKKR
4339352KKRSRPPCSKPPKK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6167621e-155JX616762.1 Gossypium hirsutum clone NBRI_GE61640 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016735007.10.0PREDICTED: B3 domain-containing protein Os03g0620400-like
TrEMBLA0A1U8N7B60.0A0A1U8N7B6_GOSHI; B3 domain-containing protein Os03g0620400-like
STRINGGorai.008G071300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16925512
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.12e-29B3 family protein