PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY33839.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family MYB_related
Protein Properties Length: 652aa    MW: 75455 Da    PI: 8.9749
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY33839.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding26.61.3e-08446485345
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                      +WT+eE el  +  +++G++ Wkt a t+g  + + ++k+ w+
       GAY33839.1 446 KWTPEELELVRKFYEKHGSD-WKTMADTLG--KHRFHVKDAWR 485
                      7*******************.********9..56677777776 PP

Sequence ? help Back to Top
Protein Sequence    Length: 652 aa     Download sequence    
MGKRTTGKKN KDDETDADTL EEEVINSRIR HEVSDNNVEF RATPNDVVAE DDVDVKRQKK  60
NKKKKRDEIR DDDVELCAVS KNEVALRDWK KRKNKQRNED GVNDAGLREG RIEDLVESNA  120
DEAESEKKEK KKKKKLLEDG SDGAESSVNQ SKAVIDIDVH VHKEKKKKKK SKKRQEVSCD  180
DDEFGSTINE TVVEGDVDMN TENKKDRKKK KKKKKKRKLE IKEDETVLEG DVDMNTENEK  240
DGKKKKKRKL GINEAKKNKD ERFNNEDEVS GVLNQGLKHH LELDENTSLH KQKNGLEEEG  300
ENNKKKKAMS MGKHSGGDKK VSRTKKGVKP NDPSESSAHK ERPKKVSFSD HVQVVPSSEA  360
KSDKNDGFVR GKRFSLEEDE MIKKAVINYI EAHRLGEDGL NMVLHCRSYP EIKHCWKEIG  420
AALPWRPCES IYYRAHILFE RDENHKWTPE ELELVRKFYE KHGSDWKTMA DTLGKHRFHV  480
KDAWRRIKLP NQKKGQWSQE EYQKLFDLVN MDLRMKASEE KRTKHGMLRD NISWEAISEK  540
LSTRTNAICC MKWYDQLTSP MVAKGKWADT DDFHLVNALS GLDACCMDDV DWDNLLEHRS  600
GTICRKRWNQ IVKHLGTDGN KSFPEQVEIL STRYCPDVLE ARLAYNSKGT TV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15693KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
25996KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
36293KKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
46394KKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
590127KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
6127135KKEKKKKKK
7127164KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
8129136EKKKKKKL
9130167KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
10131168KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
11132169KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
12165173KKEKKKKKK
13165202KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
14166203KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
15167204KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
16169206KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
17204219KKDRKKKKKKKKKRKL
18204212KKEKKKKKK
19204241KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
20206212DRKKKKK
21206213EKKKKKKL
22207222KKDRKKKKKKKKKRKL
23207215KKEKKKKKK
24207244KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
25207213RKKKKKK
26208216KKEKKKKKK
27208245KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
28208214RKKKKKK
29209217KKEKKKKKK
30209246KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
31209215RKKKKKK
32210218KKEKKKKKK
33210247KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
34210216RKKKKKK
35211219KKKKKKRKL
36211248KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
37211217RKKKKKK
38212220KKKKKKRKL
39212249KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
40212218RKKKKKK
41213218KKKKRK
42213250KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
43214219KKKRKL
44214251KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
45215246KKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
46215252KRQKKNKKKKRDEIRDDDVELCAVSKNEVALRDWKKRK
47243251KKKKKKRKL
48243249RKKKKKK
49244249KKKKRK
50245250KKKRKL
51304329KKKKAMSMGKHSGGDKKVSRTKKGVK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G41020.11e-131MYB_related family protein