PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID kfl00212_0050
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae; Klebsormidiales; Klebsormidiaceae; Klebsormidium
Family MYB_related
Protein Properties Length: 3102aa    MW: 327093 Da    PI: 8.1491
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
kfl00212_0050genomeKFGPView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.81e-0722302272446
                       S-HHHHHHHHHHHHHTTTT.-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGgg.tWktIartmgkgRtlkqcksrwqk 46  
                       WT+eE +++ +  ++ G++ +W+   + ++ ++tl q++ ++q+
    kfl00212_0050 2230 WTQEEKDRFGEMMTRQGKKkDWEQLQEAFP-NKTLQQLRTHFQN 2272
                       ******************************.************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.64E-1215371592IPR009057Homeodomain-like
PROSITE profilePS5129317.48915411592IPR017884SANT domain
SMARTSM007174.3E-615421590IPR001005SANT/Myb domain
PROSITE profilePS512935.33117851826IPR017884SANT domain
SMARTSM007172.217862276IPR001005SANT/Myb domain
SuperFamilySSF466892.47E-622252274IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.0E-422302279IPR009057Homeodomain-like
CDDcd001670.0025722302274No hitNo description
PROSITE profilePS500906.41122302274IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 3102 aa     Download sequence    Send to blast
MPGNEELPVW GGGGRGHGPP YGRDRGERID RVDRGRRDFP ERPPGFARNV APIAPPQGPV  60
PLKRRGPGRP YVYGEARYGP PGMRGAEVGP APFLEGGPPP PAARRPERDA FYAAGSPPYG  120
PAGDFARREA GRDVGYGPEV PGEGPVARGA FYRERGDAVA RDLDGGPGWQ GREPPPPMAL  180
REFRERERVR SLDRGYRRPG FAAAPYGPPP PRDNRVDRFG REWLPDTTRR PPAASDDLPP  240
VDWKSLAELP SPREGSGGRD LPPREVGGPG LVAREVGPGF REGLGPRDGG FTTQEGPLTA  300
RDGFGTRDAS VAPPVPEPLY ARREAAVGSR DGFGGRDLGP LSTREPPFGL RDGGAALKEG  360
PYFPRERPGP SPWRTSMASP PGSVPGKGGL RDREDFPPRR EGPGERIDFN DQPGPAERRG  420
DERPALFERL AERPGRAPDR TDRSPGVFIS PPPHPEYPPD KPALDKVSPR GLSSAKGSPR  480
SENRPSPRSS EAAASPRRPT EKAPPKDSSL ADRAAGAAPP VAAPVLSVAP LVASIAPFSS  540
PSPLEPSMSG QSSEARAGER PRPRLGWGQG LVAFERVKPK DEEEQVSSAR RSSGADAGPE  600
SVGVSPAVSE SKSVAHTAEP EGGVVNSLQM HKDEGKEAPP AEAAEAESER ARVEASPATP  660
DSGKLVDLLL KTAVSEALAL DSEAGAPKAQ PSQAPVPDPQ APTIKAEAVT PKVGKSSPWP  720
QSLSALSEKE VPPKTEPPMT WPQSLSALTE KEGPVKPREA APKEGPVVSV DMGPATVEAS  780
PAVKASPWPQ SLSAAMGLVS APEGLAPIRT PRLEEQSPAG QVHSDGNSIF DSLLHRAQAS  840
DAGAAKPVNP SDSIPTSPLS PTFPQPSKEE PPAFPLSAST QPMTFSSLVG SSLPEPSHRS  900
PFGDAARLTS PYQDGPRRSY LETRPYGASP GLFSTGFHRS DSGSLGLGGA AWKPEDYLRS  960
DAGFRKDGAV EAGEGSLGRT PSEAVNLKEA VLASVGERGG LESGQKTPPP QARVEAAEKT  1020
EDQSSPMQVD GPVVAEGVAE GERKVEETEE DGPAATEKPG EGAAAEEVGT GEPSKDVILS  1080
EMEQVDSEIE RLERQLSRMA SLEKSGALAS TPTAQALAEA TGIVGVQAEA MKGSQEAEGA  1140
TTAEGLSQET GIEGVAGSKQ LGGTGTEKEE APTEEEGAEK GGEAAAEEGP GKGRTPKASE  1200
GPPSKAGSLK WKWAVAGNGD LSPDTGAGKQ ETGAAAQEVP EAGEEAAAKP EGDERGTEGA  1260
AAEVKEPAAA LQAKRDFDPL AMGGPDPFLK FEQIIASQDG LARYLTEQHP PPPKAVPKLE  1320
PKADELILSA GCQDRAVFVQ ARIWANREAA RRSEAGLQHL LPAGCQLGDG PLYTCPEEAP  1380
YWQVNEEAHA RIEEVLRARL QEKKRDLAFT ERVLALRYRF LRALWRQELE GVPARPALGK  1440
ARETPRLSKD KAIRRMESGS DRKAADAKAP PKSSQRFRPG GKLDAEELQI VSRLLAEPGG  1500
EVRGHLKMTP MILTDEERAA HKFVTRNALV RDPVQEAALH KLVNPWTDAE RKLFLEKYTV  1560
YNKNFKKIAS FLEHKSTADC IRFYYLHQKS EDFDKVRRRQ QLKKRRDYRG NNSFMGVNAL  1620
AGTTGRRDRE ANAARVEALP ALTAVMKESS KAARRAMDRG KPPSMPADVE RGSQSKVAAA  1680
AVSAVVRADR GGLLDPISGD APPRKKKRDK LKFRGGPPRP EQSEGPVGGE ANAEPGPASE  1740
GVIKRVSSKG ARSNLGRKER PWLDTGTPTG AGSEERVLGL ASPEDPALQW TDLERQTFVA  1800
AIREYGKDKS SAQCKAFFSK SRKRLGLDRL LEERSLKGTP GAGPEDGGAA NNGAAPEVGT  1860
GEKVGVASES DEDRTIAEVA SIKGQGSADK GALEVAGGAA GGAKEAAPFG KKVSNEELKR  1920
ELQLGLAKVQ EKAAILGRKE QEVKASAAPP TKPTLVVKPP AGVAPGVQKS PTSLPGGADD  1980
ELRAAMTLLE AKEGLSDSAD EMEMRPLGEV LKRKGSLVGA GGDAQRPKKK KKVEKTVGAA  2040
EAGGPPKKKA GKPKGEEAGP GSQGVEPKPM KPKPKKSGEG KASSVKGERS GKAASEDLEG  2100
SVHGAPSLWR AVLPKEEGGA ADGHEMSLMA TEGSLSCSDK EGDEAGEVSA QARPAKGPEK  2160
KAKPRTKKAA PEQDVPEAET FTALLAGGDS DESSGEPILG PGKQAPAAAA AGKKRDGGDK  2220
TPGERKIAVW TQEEKDRFGE MMTRQGKKKD WEQLQEAFPN KTLQQLRTHF QNSKAKRLKE  2280
LEREAAQAEK DKERHKDKDK ERDKEKEKEK EKERERERER ERESTPTPAP PARALAATPP  2340
LPEGLREVFS PDQVQQLQKL QAHLESERAA KLLAATFDTS KHSEPPSSRE VAQPGLAGSA  2400
SLTIPLPGRV RPMTPPPAPS QAHPAPGSRL PPQKKRKLED EQPEMGGINA GPPRQEAAPS  2460
FAATPSFLGK GAIRGEEQPA RSAGGSSGFA ALLQGAGPGG GSGGGGGGSM GGKPAGLANP  2520
FMQPSARLPV DVSEQHRQLA LLMQANGMGG PAAFPPGMFP PFLFPGLMPM LMGGLGQAGI  2580
PPQMAVAAAA AAAAGGAPPR LGPFGGPPRR PSEQELQHSL EVQQQMGQRL PDDYMAAALR  2640
SAAGQAAPGG QGQRDAGLRE RPNSSDVGEG RGPSSKGRST PLDDMRSLSP PPLQTMRSSQ  2700
QHELHEQEQP RPTAAPPPAS SLRASSGSIK LFGQSVGPQS SPPQQPLRPA VHAPVPRSSE  2760
GFRGPEGLSL ERPPPPHSLH SARDGLDRRG SDERKEDGRT EPPFWAASAS GGGTGWGRPE  2820
GGPSALPHWM VEQRPTEHEQ GGGPRAREEA NEGRSSRGVE GAASPMSGSS LQEAMARLLA  2880
LDPRAALPDR DRDGGRHGAE ASGDKKGHEG MRMASDGGTR SKSEASGMLR DRAGQGGERR  2940
PLEGTAPGNR GGGREMGGGG GGLALSDGPG GISMDNMRTA YEAMAAAREA GGPAQDGGGA  3000
RDMPRNVGPG SLPPWMQQGP VPEGHHAAVR PDMFFPNVGM AGWPGAMPQH PAAQAQMALM  3060
IQTFALQQAL QQQQQQQQQQ QRQYRPSDSR EGREQGPGNN AP
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C3e-1515051593593NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D3e-1515051593593NUCLEAR RECEPTOR COREPRESSOR 2
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
117031709RKKKRDK
217031710RKKKRDKL
320112031KRKGSLVGAGGDAQRPKKKKK
420262032PKKKKKV
529512959GGREMGGGG
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A1Y1I6Z30.0A0A1Y1I6Z3_KLENI; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP32151725
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.15e-26MYB family protein