PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0000206.1_g620.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family B3
Protein Properties Length: 972aa    MW: 106908 Da    PI: 7.228
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0000206.1_g620.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B357.72.1e-18411511199
                                EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEETTS-EEEEEE....EEETTEEEE-TTHHHHHHHHT--TT-EE CS
                         B3   1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltledesgrsWevkliy..rkksgryvltkGWkeFvkangLkegDfv 80 
                                f+kvl+ sd+++ grlvlpk +ae++   +++ e+  ++++d +g++W +++++  +++s++yvl+ G     ++ +L++gD+v
  Pav_sc0000206.1_g620.1.mk 411 FEKVLSASDAGRIGRLVLPKACAEAYfpPISQSEGLPIRIQDVKGNEWMFQFRFwpNNNSRMYVLE-GVTPCIQSMQLQAGDTV 493
                                99**************************555556677***************9989999999***9.***************** PP

                                EEEE-SSSEE..EEEEE-S CS
                         B3  81 vFkldgrsefelvvkvfrk 99 
                                +F++++  +++lv++++++
  Pav_sc0000206.1_g620.1.mk 494 TFSRID-PGGRLVMGFRKA 511
                                ***887.777788877765 PP

Sequence ? help Back to Top
Protein Sequence    Length: 972 aa     Download sequence    
MTSRFQDLGF AGVEFGIMGS RICMNVLCGT TNTHEWKKGW PLRSGGFAHL CFKCGAAYEK  60
LVYCDKFHAG ETGWRDCSLC RKPLHCGCIV SKSLYECLDY GGVGCISCAK SSQPRVIQND  120
DVLNGFGGLK ISNYSDRQST VVQNGAFSNT VDEGKLLQLC KIMEANESNL LPQPQRGDIN  180
VSLVQKKQEE VINHNGEVGL GFPSTTQPSI GSLTFSKSDN GRTMIEDINK SSSQPSLSMT  240
LGSPSATPSF VQPFPGGLVD GREQSKTPSS FQQGHGREQS MTPSFQQGLV DGREQSKTPS  300
SFQQGLVDGR EQSKTPSSFQ QGLVDGREQS KTPSSFQQGQ KSRPILPKPL KPSVTMSSET  360
NKGGFPNVRV ARPPAEGRGK NQLLPRYWPR ITDQELQKLS GDLNSAIVPL FEKVLSASDA  420
GRIGRLVLPK ACAEAYFPPI SQSEGLPIRI QDVKGNEWMF QFRFWPNNNS RMYVLEGVTP  480
CIQSMQLQAG DTVTFSRIDP GGRLVMGFRK ASKSLDMQDP QKSMLPNGST PGETSCPNVV  540
ENPATGSGHL GLFQTNTGSK DPHLHALSEH LHLTDGDMSL HKNDYHGHRT SEDLLQQPVL  600
NSDKKRARNI GPKSKRLLMH SEDVLELRLT WEEAQDLLRP PPSVKPSIVT IEDHEFEEYD  660
EPPVFGKRSL FTASSSERQE QWAQCDDCSK WRRLPADVLL PPKWTCSENS WDTSRRSCSA  720
PEEMSQKDLD SLLRASKDLK KRRIIENCTE AQEHEPSGLD ALASAAILGD NVVDSGEQSV  780
GATTRHPRHR PGCTCIVCIQ PPSGKGKHKP TCTCNVCLTV RRRFKTLMMR KKKRQSEREA  840
ENAQKDNNNH KDESEINGPS TEVGLHMNHS SENGGCQSRI EADVAESSSA GQIDLNCEPN  900
PYVQASGLTL LRLADAASQP LNNFMKESCL TNMMCEPKAG IGSSLLTQAT DESERRLSAV  960
AAWDCEGRGD AD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1821832RRRFKTLMMRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32010.10.0B3 family protein