PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_34094_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family B3
Protein Properties Length: 719aa    MW: 81146.9 Da    PI: 7.7497
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_34094_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B349.86.4e-16171001099
                                 HHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE.. CS
                          B3  10 vlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefel 92 
                                 + + + l +p kf++++g+    s  + l+ +sg +W v+l  +k++g ++l++GW+eF +   Lk g  +vFk++g+ +f  
  Cotton_A_34094_BGI-A2_v1.0  17 NIRDKKLEIPGKFVRKYGNG--LSNSVLLTVPSGDTWHVEL--TKSDGIVWLQNGWQEFSEYFSLKYGHLLVFKYEGNGKF-- 93 
                                 3445669**********855..7788***************..********************************998777.. PP

                                 EEEEE-S CS
                          B3  93 vvkvfrk 99 
                                 +v +f++
  Cotton_A_34094_BGI-A2_v1.0  94 LVLIFDT 100
                                 ****997 PP

2B352.31e-16332427195
                                 EEEE-..-HHHHTT-EE--HHH.HTT.---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-E CS
                          B3   1 ffkvltpsdvlksgrlvlpkkfaeeh.ggkkeesktltledesgrsWevkl.iy..rkksgryvltkGWkeFvkangLkegDf 79 
                                 f+ ++ ps+ +  + +++pk+f+ ++         +ltl +++gr+W+ ++ +y  r+k  + ++  GW++F  +n+L++gD+
  Cotton_A_34094_BGI-A2_v1.0 332 FLVTMQPSYINPGRKMCIPKNFTMKFlT-R--DLGDLTLCTSDGRTWSAQYlRYmtRNKYTKATIHIGWRQFMLDNNLEAGDV 411
                                 667788888888899********88842.2..2338***************54488777778999****************** PP

                                 EEEEE-SSSEE..EEE CS
                          B3  80 vvFkldgrsefelvvk 95 
                                 +vF+l++++e  l+v 
  Cotton_A_34094_BGI-A2_v1.0 412 CVFELISQTEIMLKVI 427
                                 *****98666655555 PP

3B359.46.1e-19494593299
                                 EEE-...-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EE CS
                          B3   2 fkvlt.psdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegDfv 80 
                                 fkv + p + + ++ l++p kf++++  +  e+ + +l+ ++gr+W vk+ +   ++++ +  ++  W+ F+++n+L++gD++
  Cotton_A_34094_BGI-A2_v1.0 494 FKVVMqPRYLTIRCSLSIPYKFVNQYLDE--EKEEAILRVSDGRTWVVKFAVkvvTGGQHKAEFSHTWRAFARDNNLEVGDVC 574
                                 5555505555566**********999433..23479**************77766666666889999**************** PP

                                 EEEE-SSSEE..EEEEE-S CS
                          B3  81 vFkldgrsefelvvkvfrk 99 
                                 vF+l++r+e +++v++f++
  Cotton_A_34094_BGI-A2_v1.0 575 VFELINRNENSFKVSIFSA 593
                                 *****9988889***9985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.101.3E-242101IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.35E-242101IPR015300DNA-binding pseudobarrel domain
CDDcd100171.19E-19799No hitNo description
PROSITE profilePS5086314.1568101IPR003340B3 DNA binding domain
SMARTSM010193.0E-159101IPR003340B3 DNA binding domain
PfamPF023623.2E-1317100IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.6E-21324429IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.69E-21325430IPR015300DNA-binding pseudobarrel domain
CDDcd100173.67E-20330430No hitNo description
PfamPF023623.4E-15332430IPR003340B3 DNA binding domain
SMARTSM010191.5E-11332432IPR003340B3 DNA binding domain
PROSITE profilePS5086313.662332432IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.3E-23485592IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019368.24E-22487592IPR015300DNA-binding pseudobarrel domain
CDDcd100173.10E-23492592No hitNo description
PfamPF023626.4E-17493593IPR003340B3 DNA binding domain
SMARTSM010197.0E-13494594IPR003340B3 DNA binding domain
PROSITE profilePS5086314.536494594IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 719 aa     Download sequence    Send to blast
MFTPKTPHFF KIILDANIRD KKLEIPGKFV RKYGNGLSNS VLLTVPSGDT WHVELTKSDG  60
IVWLQNGWQE FSEYFSLKYG HLLVFKYEGN GKFLVLIFDT SASEIEYPCK SHIEDQKSDD  120
QMCLKLVKEE PKDDVCDKPL YGTPSCKETK RKKSRPLCSN TEYLCKSYIE DPKSDDQLCL  180
KPVKEEAKDG VYDETLYETP PCKKARRNKS QSQCSKVEYP CKNHIEDNKS EDQVSLKPVK  240
EEAKDDVCDK TLYETPPCKE TRKKKPQSQC SKPRKKLKTA QKDKSKKDRE DASTDEGGLK  300
TKVPRDGRAF GAIEYDKALQ RAFSFESENP FFLVTMQPSY INPGRKMCIP KNFTMKFLTR  360
DLGDLTLCTS DGRTWSAQYL RYMTRNKYTK ATIHIGWRQF MLDNNLEAGD VCVFELISQT  420
EIMLKVIIYR VHQDASCSSP LGGINSLEIG DNVSSSIPGS TESKHHCSIR PLTPHEKARA  480
IQKASNFKSK NPFFKVVMQP RYLTIRCSLS IPYKFVNQYL DEEKEEAILR VSDGRTWVVK  540
FAVKVVTGGQ HKAEFSHTWR AFARDNNLEV GDVCVFELIN RNENSFKVSI FSAAPGANSS  600
VSPQAHDVKA SQVASKNCSV PKIEADDEFG NCYAGNPGPA AQPTTIGLQD NEEEVNPTDD  660
AIASQVASKD FLVPKVEVDD DFGKCHAGNF SPAAQITTTG YQEAEEEVKP TISTRPRDP
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A2e-1830859125144B3 domain-containing transcription factor VRN1
4i1k_B2e-1830859125144B3 domain-containing transcription factor VRN1
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1272278PRKKLKT
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6167621e-65JX616762.1 Gossypium hirsutum clone NBRI_GE61640 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017638159.10.0PREDICTED: uncharacterized protein LOC108479849
TrEMBLA0A1U8LBD90.0A0A1U8LBD9_GOSHI; uncharacterized protein LOC107925711 isoform X1
STRINGGorai.008G071200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16925512
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.15e-29B3 family protein