PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre09.g411600.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family MYB_related
Protein Properties Length: 2173aa    MW: 218998 Da    PI: 7.2617
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre09.g411600.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding31.54e-103877142
                        TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHH CS
     Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcks 42
                        + +W+ eE +ll++ ++++G++ W++Ia+ ++ gRt+ ++k+
  Cre09.g411600.t1.1 38 KPPWSSEEFLLLARWHAEMGSR-WAAIAKLLP-GRTESDVKN 77
                        579*******************.*********.********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466894.06E-103283IPR009057Homeodomain-like
PROSITE profilePS5129411.1263387IPR017930Myb domain
Gene3DG3DSA:1.10.10.602.6E-113483IPR009057Homeodomain-like
SMARTSM007177.6E-93785IPR001005SANT/Myb domain
PfamPF002492.0E-93877IPR001005SANT/Myb domain
CDDcd001672.14E-74077No hitNo description
PROSITE profilePS500903.892889911IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2173 aa     Download sequence    Send to blast
MGDARSVGGG HSNELIAGLY PRPAILAPSD ATDRRVSKPP WSSEEFLLLA RWHAEMGSRW  60
AAIAKLLPGR TESDVKNLFY GTQRAKRAPR SQFLFFYVQA LGPPDSRGAD DHATRQAAFA  120
AAKFALEAGQ LRLEGAVEPE VVQRLLGPPG PGTSLRQAAA AAGGDGSVDL PPINEWLTSA  180
AAAAAAAAAA AAVAERADPA AAAAAAAAAA AAARAECGGG GGGGAATAGV RGQAPAGFRS  240
VAGSGMDGEA GAALVAAAAA AATAAAIASR GGGRAAAPAG GAAKPGSGCG GGGGGALPLA  300
VAGGGLATRN PRALPPRRIQ LMPEDAAAAG AAMVAAAAPP AVAAAAAAAA AEGGRGEAGW  360
SEPPLPAAAA AAAGMDMAGA ADSPSSSASA SATIGGIPAG AGCHRPGAGR GLDRLAAVAA  420
AAAAAYASSR APMTLDLSRT APLNLMMDLN LGGRGGGGRG EGGGGGADSD GSPAGGGGCA  480
ARAPPLRSGG SQAGPVGYAA AGAPAPGDGG PPSASASAVA AAAAAARAGR SGRHGQPLPS  540
FDDGPDTGAG TVRGQPTAGG GGGAPDQWPA GGGGESPQGR ARDPPPHSDE ADWGNGNGNG  600
GSGGGGGGGG GGGGGGGGGG GGGGGGGGGG GWRHSRAPHG PGVSGPGRHH QHHQHHQHSE  660
SPSATSLTGS LVRLDLSGQQ PAEPHHPHQH HQQHQQPHRQ QHQQPRQQPQ PQRDSPQRDS  720
PQAQSWPAAR LLGEGPQGLD ASPSAQAGSG GGGGGGGGVG QDSGGPNSGP ASGGPGSKRL  780
PPMLLQHGPM AAAALSDGGF GGGGGGGGGS AGDTPVPAHM RNDGGGGRQQ YGERQAPPSQ  840
SQPLPARVAA GAAAAAAAFG AEGRREQCVG HPHPQQQHPH QQQGPGKTWT DDETRRVMEA  900
ARRLHKRQAW EDEMVTAGVG ERRRQPPPGS QCGAGGVSEG GEWAGQAGAE GSRANYSGCR  960
AGAGPPSAAA AIGGSREGVG GGAWERRPPG GLGPSFGRPG PYGPMSADVL MAPADATAGA  1020
AATAGAAATA MPRRGVATVG GYDERMGECD DRQGCSTAVA YRALQQPQQP QQQQQPQAQY  1080
RAQHRTQQAA MQQQQPQQQQ HSQQQQQQGA WGRGYGSPGG RPSPLARTAA AHGAQPQPQQ  1140
HQQHRPPSLH TASILEGHAE PPPCGSGTPY DRPAGVARAS MDGAVPRTMP YGDGPASAPP  1200
DHIRYRYSPH ATAPGGSGGG VGGGGGAAAT RYSHGAVTVK QEPAPYTEAE DRLYSSSYNC  1260
QQPPLAEPLI AAGQQQQEPV PQRYPQEQQQ QQQPQRRQPY TSTATCAGRR HAHAPYGAPN  1320
SGPEGPCTAP SAYGAGNGDD GDYLPPRTVS VAHLAPSDPE TADAGSFVLY RTAPGSGREH  1380
AVVPPPPHPK PPPPGSAAAA LMAAAAAAMA ANPYNSSGPS SGPSTGRSVA PQSGRGAACA  1440
DAVAAPAAHP PAYYAEEEEE QEEDQWIARP PHQQHQQHQQ PGLYGRADAP RGPPSGAAGA  1500
YAPQQRQHHH HHQRPPQRQQ HHHHQQAHQQ PMPRQGSYGP PPQQQPAAEG RGEWVLRDRR  1560
PSQQPNPSPF RRDPYGEPPF QDVPSGGGGG GGGWDAAADG AEEEYPEEVV AESQPNLAPP  1620
LPRAASAAAV PDAYEAQRHS QERAGYCSAN AARYANAGRQ AAQRCPQQQQ QQQQAYWPAA  1680
PPPAQQQQSH GPSRFQFRDP NGYSGYPCDL DPTGLQPPQR QPQPPPPPPQ QLRPYASAPQ  1740
PQPPHQPPQQ HQTEPQVYIK QQPYDEVDED YGRPVRPTGH FVHAQSGSMP SGGGGGGGVP  1800
YAPVSYRDGN DYAPPGEAGN GSGGGGGVGR YPPPVRPWHV PVPLPLAQAE AERGRYQPPG  1860
GSRQRCYEDD YMVIDGPASA PPRTYAAAAP AAPSSGPWAA TGGGGGLAAA AAARSRPIAL  1920
GSAPWPSQPP PPPLYEVEEV VAESSVAGDL APPPPPGPRS AQAFGRSSWR GVEGAAAGGG  1980
GRSGEYPSAA AAVGASHGTG CGGDGAGGGC CGGYPGSYAG DGEGSGMVMD VAPPAATADT  2040
EDDYNGDGYG RPQHRRYHAP RPDAEQQLAA AAQHSRPQPQ YRQSHQQQHP HQPYTQHQQR  2100
AARAPAGAPP PAQHPYQQHQ HQHPHHHPQP HSQQQPQQQY QQHELQQYRP VNDGRFGDGD  2160
DCGASELDRS SL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1456464GGRGEGGGG
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre09.g411600.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3DFM40.0A0A2K3DFM4_CHLRE; Uncharacterized protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G18770.12e-09myb domain protein 98