PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_34095_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family B3
Protein Properties Length: 1516aa    MW: 171151 Da    PI: 9.2964
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_34095_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B349.11e-153110917100
                                 E--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE.EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
                          B3  17 vlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr.yvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                                  +p++f++++g++   s  ++le ++g  W+v+l  +k+ ++   l++GW+eF++   L+ g fvvF+++g+++f  +v +f+
  Cotton_A_34095_BGI-A2_v1.0  31 GIPRNFVRKYGNQ--LSSPVKLEVPNGAIWQVEL--TKSTDEkLHLQNGWREFAEHYLLEFGSFVVFRYEGNDRF--HVLIFD 107
                                 59********988..5567***************..666655599**************************9999..*****9 PP

                                 SS CS
                          B3  99 ks 100
                                 ks
  Cotton_A_34095_BGI-A2_v1.0 108 KS 109
                                 85 PP

2B357.13.2e-18176271196
                                 EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE..EEE-TTHHHHHHHHT--TT-EEE CS
                          B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr..yvltkGWkeFvkangLkegDfvv 81 
                                 f   + ps+v  + r+v+p +fa ++  k+++   ltl ++ g++W v++  + +s++    l +GW+ Fvk+n++++gD++v
  Cotton_A_34095_BGI-A2_v1.0 176 FMVLMRPSSVCYKYRMVIPLDFALKFLPKHNC--NLTLCNSAGKTWPVTFYRNTQSKKlsAQLYGGWQTFVKDNRINVGDICV 256
                                 6677899****************999555556..6***************33444444677888******************* PP

                                 EEE-SSSEE..EEEE CS
                          B3  82 Fkldgrsefelvvkv 96 
                                 F+l+++ e  ++v++
  Cotton_A_34095_BGI-A2_v1.0 257 FELIRQPEILMKVQI 271
                                 ****98666577766 PP

3B362.47.1e-20474569196
                                 EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT- CS
                          B3   1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltledesgrsWevkl.iy..rkksgryvltkGWkeFvkangLkegD 78 
                                 f  v+ ps+vl ++r+ +p +f++++  ++k  +   ltl +++g++W  k+ +y  +kk++++ l +GW eFv++n+L++gD
  Cotton_A_34095_BGI-A2_v1.0 474 FMVVMRPSYVLGKCRMFIPSNFTRKFltMYK--C--NLTLCNSTGKTWHAKFfRYpeNKKPNAH-LYGGWCEFVEDNHLNVGD 551
                                 77899*********************76554..5..59**************445977777777.677*************** PP

                                 EEEEEE-SSSEE..EEEE CS
                          B3  79 fvvFkldgrsefelvvkv 96 
                                 ++vF+l+++ e  ++v++
  Cotton_A_34095_BGI-A2_v1.0 552 ICVFELIKHPEILMKVQI 569
                                 ******998665566665 PP

4B330.75.6e-10696768375
                                 EE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT-- CS
                          B3   3 kvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLk 75 
                                  v  ps+ ++s+ l +p +f++++ +k   s +++l+  +gr W v+++    +k++++ + ++ W +F+k+n+L+
  Cotton_A_34095_BGI-A2_v1.0 696 VVIHPSYIGSSSSLHIPVEFVKRYLKK---SGEMVLRVVDGRIWIVEYRRkasNKGGKAKFGSRSWGQFAKDNQLE 768
                                 5677999*****************533...448***************877755555558888************8 PP

5B351.51.9e-167918731098
                                 HHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE.. CS
                          B3  10 vlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefel 92 
                                 + + + l +p+kf++++g+    s  + l+ +sg +W  +l  +k++g +++++GW+ F++   L+ g f+vF+++g+ +f  
  Cotton_A_34095_BGI-A2_v1.0 791 TIRDKKLGIPRKFVKKYGK--GLSSSVLLTVPSGDTWHAQL--TKSDGVVWFQNGWQAFAEYYSLQYGHFLVFRYEGNGKF-- 867
                                 4455669**********84..47778***************..********************************998777.. PP

                                 EEEEE- CS
                          B3  93 vvkvfr 98 
                                 +v +f+
  Cotton_A_34095_BGI-A2_v1.0 868 LVLIFD 873
                                 999997 PP

6B353.93.3e-179991097198
                                  EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT- CS
                          B3    1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkl.iy..rkksgryvltkGWkeFvkangLkegD 78  
                                  f+ ++ ps+ +  + +++pk+f+ ++  k++ + +ltl +++g++W+ ++ +y  r+k  + ++  GW++F  an+L++gD
  Cotton_A_34095_BGI-A2_v1.0  999 FLVIMQPSYINPGRKMCIPKEFTMKFL-KENLG-DLTLCTSEGKTWSTQYwRYisRNKYTKAIIHIGWRQFMLANNLEAGD 1077
                                  666788888888899********8884.44454.9***************433777777779999**************** PP

                                  EEEEEE-SSSEE..EEEEE- CS
                          B3   79 fvvFkldgrsefelvvkvfr 98  
                                  ++vF+l++++e  l+v ++r
  Cotton_A_34095_BGI-A2_v1.0 1078 VCVFELISQTESMLKVIIYR 1097
                                  *******9877767777766 PP

7B3543.1e-1711611259199
                                  EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT- CS
                          B3    1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegD 78  
                                  f+ v+ p +   ++ l +p kf++++  ++  + + +l+ ++gr+W vk+ +   ++++ +    + W+ F+++n+L++gD
  Cotton_A_34095_BGI-A2_v1.0 1161 FKVVMQPRYLILRCSLGIPYKFVKRYLDEE--KEEAILQVSDGRTWVVKFAVkvfTGGQHKA-EFSTWRAFARDNNLEVGD 1238
                                  4455667777788*************4332..3479**************775543333333.3369************** PP

                                  EEEEEE-SSSEE..EEEEE-S CS
                          B3   79 fvvFkldgrsefelvvkvfrk 99  
                                  ++vF+l++r+e +++v++f++
  Cotton_A_34095_BGI-A2_v1.0 1239 VCVFELINRHENSFKVSIFSA 1259
                                  *******9988889***9985 PP

8B350.53.8e-1614101501191
                                  EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEETTS-EEEEEE....EEETTE..EEE-TTHHHHHHHHT-- CS
                          B3    1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltledesgrsWevkliy..rkksgr..yvltkGWkeFvkangLk 75  
                                  f  ++ ps+v++  rl +p +f +++  +    ++  +  +  +gr+W  k+    +++       l +GWk F+k+n+L+
  Cotton_A_34095_BGI-A2_v1.0 1410 FTVAMQPSYVSNGYRLAIPLDFSRKYlrN---GSGNAILSTVGDGRTWLTKYHReaKGT--NprAKLIDGWKTFAKDNNLE 1485
                                  677899********************542...334455556699********5544333..24588999************ PP

                                  TT-EEEEEE-SSSEE. CS
                          B3   76 egDfvvFkldgrsefe 91  
                                   gD++vF+++++++ +
  Cotton_A_34095_BGI-A2_v1.0 1486 IGDVCVFEMINSEGYQ 1501
                                  ********98655443 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019368.24E-2023109IPR015300DNA-binding pseudobarrel domain
SMARTSM010192.8E-1224109IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.105.4E-1929109IPR015300DNA-binding pseudobarrel domain
CDDcd100173.40E-1532107No hitNo description
PfamPF023622.3E-1232108IPR003340B3 DNA binding domain
PROSITE profilePS5086313.6232109IPR003340B3 DNA binding domain
SuperFamilySSF1019364.32E-22170272IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.107.0E-22170272IPR015300DNA-binding pseudobarrel domain
CDDcd100174.84E-22174273No hitNo description
PfamPF023625.6E-16176272IPR003340B3 DNA binding domain
PROSITE profilePS5086314.24176275IPR003340B3 DNA binding domain
SMARTSM010193.7E-15176275IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.105.5E-21468569IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019364.32E-22469569IPR015300DNA-binding pseudobarrel domain
CDDcd100176.90E-22472571No hitNo description
PfamPF023627.4E-18474569IPR003340B3 DNA binding domain
PROSITE profilePS5086313.535474573IPR003340B3 DNA binding domain
SMARTSM010197.5E-14474573IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.1E-12685769IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.87E-12689768IPR015300DNA-binding pseudobarrel domain
CDDcd100178.94E-14692768No hitNo description
PROSITE profilePS508638.473694776IPR003340B3 DNA binding domain
SMARTSM010194.6E-6694780IPR003340B3 DNA binding domain
PfamPF023624.6E-8696769IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.2E-25775875IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019364.32E-24777875IPR015300DNA-binding pseudobarrel domain
CDDcd100179.64E-20781873No hitNo description
PROSITE profilePS5086313.86782875IPR003340B3 DNA binding domain
SMARTSM010192.2E-17783875IPR003340B3 DNA binding domain
PfamPF023622.7E-13791873IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.2E-219901097IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.28E-209921097IPR015300DNA-binding pseudobarrel domain
CDDcd100178.77E-229971097No hitNo description
PROSITE profilePS5086314.1849991099IPR003340B3 DNA binding domain
PfamPF023629.2E-169991097IPR003340B3 DNA binding domain
SMARTSM010193.8E-119991099IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.2E-2011521258IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.53E-2011541258IPR015300DNA-binding pseudobarrel domain
CDDcd100171.31E-2211591258No hitNo description
PfamPF023622.2E-1511611259IPR003340B3 DNA binding domain
PROSITE profilePS5086313.83111611260IPR003340B3 DNA binding domain
SMARTSM010195.1E-1511611260IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.105.0E-2114031511IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.75E-2114041500IPR015300DNA-binding pseudobarrel domain
CDDcd100171.23E-2014081510No hitNo description
PfamPF023621.8E-1414101499IPR003340B3 DNA binding domain
PROSITE profilePS5086314.81914101512IPR003340B3 DNA binding domain
SMARTSM010192.6E-1414101512IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1516 aa     Download sequence    Send to blast
MDFPVLVKRE TMEESVLLNS LPPFVCLFVY GIPRNFVRKY GNQLSSPVKL EVPNGAIWQV  60
ELTKSTDEKL HLQNGWREFA EHYLLEFGSF VVFRYEGNDR FHVLIFDKSA SEIDYPYTST  120
EDDDGDGDDH ISVGVIAARP SVKTEVLSCP QKLTDTEKAQ AVQIASAFKS TEHPVFMVLM  180
RPSSVCYKYR MVIPLDFALK FLPKHNCNLT LCNSAGKTWP VTFYRNTQSK KLSAQLYGGW  240
QTFVKDNRIN VGDICVFELI RQPEILMKVQ IYPAAKNASN ACRSQADNSI ASQLRTGSLV  300
SVTEPDCQQT RCPYSSSELK DSKLKTQKNI DFQNSTKELK GEFRCLAKND NGGVSGDWGC  360
LKPDLVSKMQ PLTPTEKQRA TDIASWSSLR LVSGKSKEKS HLPCSLPQKK MRINSPNQHG  420
QNSKLEVLSS GISSDAQRSV TTEALSCVQR LTAIEKTHAV QIASAFKSTE NPVFMVVMRP  480
SYVLGKCRMF IPSNFTRKFL TMYKCNLTLC NSTGKTWHAK FFRYPENKKP NAHLYGGWCE  540
FVEDNHLNVG DICVFELIKH PEILMKVQIY PVVKNASKAC GPQALGSIVS RVKTRSLVSD  600
AKRSCQQSAR PPSSREFIDL TDSYIESLDD SPLDQKTKKK MTSPSFQPCE LKYSVRNDRE  660
RRSSARRCPK PDPVYGKQRA CVGASAFRTS NPSFSVVIHP SYIGSSSSLH IPVEFVKRYL  720
KKSGEMVLRV VDGRIWIVEY RRKASNKGGK AKFGSRSWGQ FAKDNQLEKA NNGSMFAPKT  780
PHFFKIILEA TIRDKKLGIP RKFVKKYGKG LSSSVLLTVP SGDTWHAQLT KSDGVVWFQN  840
GWQAFAEYYS LQYGHFLVFR YEGNGKFLVL IFDMSASEIE YPCKSHIEDH NSDDQVCLKL  900
VKKEAKDDTC DGTLYETPPC KETRKKKKKK KRSRPPCSKP RKKLKITQKD KNEKDWEDES  960
TREEDMQTKV PRDEHAFGVT EYDKALQRAS SFRSENPFFL VIMQPSYINP GRKMCIPKEF  1020
TMKFLKENLG DLTLCTSEGK TWSTQYWRYI SRNKYTKAII HIGWRQFMLA NNLEAGDVCV  1080
FELISQTESM LKVIIYRVRQ DTSCSSPLGG INSSENGGNI NSSTLGSTES NHDCLMRPMT  1140
PVEKARAILK ASNFKSKNPF FKVVMQPRYL ILRCSLGIPY KFVKRYLDEE KEEAILQVSD  1200
GRTWVVKFAV KVFTGGQHKA EFSTWRAFAR DNNLEVGDVC VFELINRHEN SFKVSIFSAA  1260
PGANSSLSSQ ADDAEASQVA SKNCLVPRIE ADDDFGNCYA GNSGPAAQLT TIEYQENEEE  1320
VNLTDDAIAS QVASKDWLVP KIEADDDFGK CHVGNSSPAA QFPAIGYQET EEEVQPTIST  1380
RPRGPQRLQA REKAKALQRA SGFKSQNPFF TVAMQPSYVS NGYRLAIPLD FSRKYLRNGS  1440
GNAILSTVGD GRTWLTKYHR EAKGTNPRAK LIDGWKTFAK DNNLEIGDVC VFEMINSEGY  1500
QLSLNVAIYK PQEDQT
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A3e-16443150418144B3 domain-containing transcription factor VRN1
4i1k_B3e-16443150418144B3 domain-containing transcription factor VRN1
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1923931RKKKKKKKR
2923944RKKKKKKKRSRPPCSKPRKKLK
3925931KKKKKKR
4927942KKKKRSRPPCSKPRKK
5928943KKKRSRPPCSKPRKKL
6929942KKRSRPPCSKPRKK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6167621e-150JX616762.1 Gossypium hirsutum clone NBRI_GE61640 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017635905.10.0PREDICTED: B3 domain-containing protein LOC_Os12g40080-like
TrEMBLA0A0B0P1A80.0A0A0B0P1A8_GOSAR; Uncharacterized protein
STRINGGorai.008G071300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16925512
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.18e-47B3 family protein