PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY41862.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family MYB
Protein Properties Length: 1450aa    MW: 163232 Da    PI: 6.5337
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY41862.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding40.94.8e-1311581203146
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                       +g+W +eEde l++ vk++G+++W++I ++  + Rt+k+c++rw +
       GAY41862.1 1158 KGPWKAEEDEVLINHVKKYGPRDWSSIRSKGLLQRTGKSCRLRWVN 1203
                       79*************************8887799**********87 PP

2Myb_DNA-binding41.14.3e-1312141253344
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrw 44  
                       +++ eE+ + +++ +q+G++ W++Ia +++ gRt++++k++w
       GAY41862.1 1214 KFSLEEERIVIELQAQFGNK-WARIATYLP-GRTDNDVKNFW 1253
                       799*****************.*********.*********** PP

Sequence ? help Back to Top
Protein Sequence    Length: 1450 aa     Download sequence    
MARWDEILSL PVQNPPTLEF ASVDLVWSKV EGWRDKLDRV ALIPFARVDD FVRGESSNKD  60
CPTRFHVEAR RRRSTSTSCK PKVDGILEYI LYWCSFGPDD HRKGGIVRPS RTTYVPKKKN  120
AGRPNTKRGC TCHFIVKRLI AEPSVALIIY NDEKHVDKNG LPCHGPQDKK AAGTRAMFAP  180
YISEDLRLRV LSLLHVGVSV ETIMQRHNES VERQGGPCNR DDLLTHRYVR RQERSIRRST  240
YELDLDDAVS INMWVESHQS YVFFYEDFSE YDPFTLGIQT EWQLQQMIRF GNRSLLASDS  300
RFGTNKLKYP LHSLIVFNSD KKAIPVAWVI APSFSSADTH RWMRALYNRV RTKDPTWNLA  360
GFIVDDPSAD CSVLVSFWRV RHAWHKNLVK RCSEIGMRAE IFRCLGVAVD DICKGHGTIA  420
LFENCMEDFM DGSDFMDYFK AVWYPRIGAW ITVLKTLPLA SLETSAAMEF YHNQLKVRLL  480
NEKDSGVYQR TDWLVDKLGT KVHSYFWLDE YTGKDDFARY WKDEWVSGLT CWRKALKILD  540
SDVVIEGRCG KVTDQLDGNK VYVVRNPGSQ FGICNCSWAE MGYLCEHLLK YNKALMDMLH  600
CTPHDSLIRD HAISLAVSIQ KQLNASVDFE SSQISVASVE KQIVETNEQQ TVGTFHADQD  660
RELVNEGHCV NDDVSSQKGR NRGEELVASG GTANELAGGL INQLVSANSL CGGTTEEEIS  720
FAKTDVEQSP IYISTPGLVS VDELASSGGF SKNEQRALVS DAEISGYTHS KDAAVTDQNE  780
AEEGISDKDC HQDLDVEPFT IDMPPPTMEF LEQCTVSPQN GISSLDPQLP VLSNKADADS  840
HSDKASRPMY MPVESKAVGV SETAGIVGDN ENEVGNAKGG AAKSPCSTDI ALVSDGSCDD  900
AANNSNTCHN ANGVQSVMPS ESSRNHMSIP SDTENQQAVG VVSQKINSCS VPLGDKPSVN  960
VCELKRNAEE GDCDNKLTVT KKAKTENEPA SLKLEKHHTD AGTSSVYCNI IRPTPRFLLS  1020
DVVPDKILHK RRKGNGVIYF YLYEIDELKN KHSKRKLVRA RVRVRVRVRV RLGHENDEID  1080
RKIKSSRGIC VILSTSFECE KKLGNVGELV ERCGGNRLNR RLLKNKEKER RVEEEVSAIE  1140
VGELRVEAME GKREEIRKGP WKAEEDEVLI NHVKKYGPRD WSSIRSKGLL QRTGKSCRLR  1200
WVNKLRPNLK NGCKFSLEEE RIVIELQAQF GNKWARIATY LPGRTDNDVK NFWSSRQKRL  1260
ARILQNSATT SCSNSKSYKT KREFPASPDA STLEAPKFTS SMEEESSSKR QSCLSSNMMG  1320
NAELIAMAPL PEMINPKLLH FGASCENNPC TEPVLPIYFP QISQPQQDLP FSPESQELLA  1380
RLEDPYFFDV FGPVDAPPEL SGQPFLKPET SCRTGMKDEN DKTVNPDAFF DDFPTDMFDH  1440
IEPLPSPSDW
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110301056KRRKGNGVIYFYLYEIDELKNKHSKRK
210591069RARVRVRVRVR
310611071RARVRVRVRVR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G60460.19e-94MYB family protein