PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre06.g264400.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family MYB_related
Protein Properties Length: 3030aa    MW: 298093 Da    PI: 6.7362
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre06.g264400.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.43.9e-0921982241348
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
     Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48  
                           W++eE + +++++k +G + W++ a+ ++ +++  q+k ++++y+
  Cre06.g264400.t1.1 2198 YWSQEEKDVFLHVFKSYGRD-WARLAEAIP-TKSTSQIKTFYHNYK 2241
                          5*****************77.*********.*************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.22E-1115941649IPR009057Homeodomain-like
PROSITE profilePS5129313.8415991650IPR017884SANT domain
SMARTSM007174.5E-516001648IPR001005SANT/Myb domain
PROSITE profilePS512938.38218581906IPR017884SANT domain
SMARTSM007171818591904IPR001005SANT/Myb domain
PROSITE profilePS5129311.20321942245IPR017884SANT domain
SMARTSM007171.4E-921952243IPR001005SANT/Myb domain
SuperFamilySSF466891.35E-1021962245IPR009057Homeodomain-like
PfamPF002493.6E-821982241IPR001005SANT/Myb domain
CDDcd001674.88E-721992241No hitNo description
Gene3DG3DSA:1.10.10.608.6E-721992241IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 3030 aa     Download sequence    Send to blast
MDRGPPRAGP GGHRPPYDGP TSPREFYGGP PRDGRDFRRP SFSQDRGFAS PVGRAGNFNG  60
PVSPDRHRGY SPERGRPYSP PDHDRRGFSP ERNVDRDRGP RFQMQERPNG YRGERPRDEG  120
FRRDNYGYRG GGGGAGGGID EQGRPPVRSR DPSPDHHHSH YHSTRDQPIR ERGHEVDQHG  180
PRPTSREGTP LDNWGKSGAG TGGLSSHGSH GMGGSRGGGH RTNTAAVSLG PAAGGGGGGG  240
GAGSSRQGAP SGPLSVAAPS GDFADTGPSS NGDLEAGEVL PGPTPGSRGG EQHTRPSPEH  300
HNSGHHHSSH HNSHGHHHHH HHHHSSSYHS HGSHQSGGPG GGGAGGGAGP GPGSTSHHHS  360
HHGHRSSSGP PGGGTRGGSS GSWEAKGAGG GGAGANQRDG PPLREASSRD VRGSEHGGRD  420
SRDGGGREAG RDAGGSSGRE AREGSRPPQR TPSLTGAGGQ QQPTLRDGSG PGRPELVTRE  480
GSVPVRDHRD RDRDRDRERE REHSAGKGRE PGRSSSQGGG GGPPSPGQVG REGRGPADGS  540
EPRWSSGGGG LHHAEQRRVS SGAVPGLEAA DRRPSNTGAE LDRRGSAHLA SLDRRASGSS  600
SLGQQPSASE AQQPRVSAGG GGDAPGPNDG GAGGGGGGGA AGPAAGVDDH DCEPGQVLLP  660
PPQAQPTSGS GLDRRPSGMT GGSTGAAAGA GTGAGLHAVG SVNSLSAAPG AAAASVEPEP  720
GEVMSDGEVE PEQEAAERAD SGQLDPGQIP PNELESGAPT DPPPPAAEAS NAAVARAATA  780
AAPAVEKAGA PADPFAGPAA GATAAAAAGG GGADAVAAGS REEDAAVPSA SGAAAVSSDM  840
ERAASGGAGG LSERQPSLTG GGGGGGGPER APSLSAGLGG AGGGGGGEEG MGTERKSSLS  900
IRRFGFGRAR RSLPAKSAGE GIPEEGEAAG LDRAPSASAP RVSQPGDAPE AGEAGSAAPG  960
ATAGGDAVAA SGAASGEPAG LGTGGARDLL SSLPPILTIP TPAGALPSAA PTPGTALTPG  1020
TALFSNATTP MPGTTPASTI TPYHGSAGGP SMASTPGSAL TGSGPVGATP AGGAAGPVVA  1080
GASTAVPMEV EGGQQQQQQQ PSAAQQDVNN AAAAAQQGQQ PQPQGETGGN SAETVAGLSA  1140
RIETLETDIT DLERQLSELM AEVNQNRSEA SALEAALGSL QEQLLEDSSD EDADDDDISV  1200
SDHSSDLDGL DLDDGVGVGT AAGAGGQQHT AAAASTSDAA EAGQAAAAVI AAAAAAAAAA  1260
AAAALAASGR RTSPAPGMEA GFSGALAKEE GEAAAAAARA QDAEDDAAAA AASAPSNKPR  1320
GRPRSTAKMR AAEAAAAAAA AADSRRKQTE QVTRVPADGM LHLPAVHFKP RTLAAAISTV  1380
RQGHEEVLRM LPDDLAARVR DAIAAAAAKQ CRAAAAAPCP GYGRWAVQVP LVEVIVEPMY  1440
REPTDLPAYH ANQARHAAVR GAVARYLRQR RGLVAAKQEA LVAQYGTNMA AYKQYVTNNG  1500
RRPMLPIPMP TGRGHGLYGS RAAAATQPAY NPYAYSHSDV VRSDLDEQRL LNNFAAVETL  1560
KHISAVPDMV LDSWERRWRA YDNRCGLVLD PLRELEEERV LKAWAEEERT LFMDKFLAHP  1620
KDFRKIATFL PGRSPGDCVA FFYKNQKLDD FSTVRRKQQL KKRRLQADMR KQQYAPLLVA  1680
PIVAHRQRAA AQQAAGGPGG AAGVGPSGPG AGGGMGPGAD GRGGGRGGRG GARSMAAGSG  1740
MHHRRPSGQD PLEGYGPGAG PLPVPGSGPQ SMAHAVSAPL PVSAGRGPSG LPPGAAAVMA  1800
APGGYQTVMA AAAVHRGNGS DGRLDALGGS PPTGPPLLPP IGGSMTVQAP SPPHHGGGSS  1860
SGWTEDDFIE CYRTHGKNWE AYCRVLGMRS ESAAKQYYYR NRERLGLMEP PPQAAAPQRV  1920
PLLPSADVAA AAAAAAVAAA AAAGRGGREG ASEGASPQFD ELGLGGLQQP VLLPVKGASM  1980
RGRRGPTRTP TAGGGSLNNG PPLHSGAGSL ESSEALGRHY STKGDLDPEM RSDSEYDPAA  2040
AGAAAAAAAA AAGAAAGGDA ASAVDLLAAL RNPDSSALTQ LLAGGLLTAP PGKGAGGDQL  2100
GSMLGQLLLQ QEQRLEARER EQQQPSGGLG LNLNVLLPGG GNAAGGGSGL HLFTAGGSPA  2160
PMDEQLHGGA LPSPMDDLGG GGGGAGGLGG SGRRGANYWS QEEKDVFLHV FKSYGRDWAR  2220
LAEAIPTKST SQIKTFYHNY KTKMGLDKLE PPVGGPPTGR GRGGAAAAVR AGRESAAAAA  2280
ALGLRGVDLG DAGGGGGGGF GDADERPAKR QALTPTQELL AALTGAGGSG GADLSALTGG  2340
GGVDGGGGLS QLLDLQLLAG AFNGQGGSGG GGNGSSAASP ASILQLLSSL SELQGGPPSA  2400
GRDPGGDGGD GGLRLGPGAG GSASSTRGPS PGPGGNGGGS QALQALNLVS SRQGDGGLQG  2460
LALPSGIGLQ GVPLTGPGGL VSLFGGGGGG GGGNASGGSA TPLKLQALNM DLGTLAGLLG  2520
GSLDREAGAH GNGGGGDGGL GALAAHLQLQ HAFAGLDRER HAAAGGGGGG ALGGGLGGLG  2580
GIGGGGDSAA QVRELQAQLQ QRERERERER ELREMREFRE LERELREQQA EAEAAAAGAM  2640
SVTAAGLLEV FTSHMAQQRR QGQHPHPQQP EPLQHESSAA ANAAGGGGGG GGGTSTRAAL  2700
EALAAATAAA AAEAPDHRFA QATGEFGAAA AAAAANAGSG GPSAAALLAA LKGAMGELNG  2760
AGAGASGAAV TGVGRRSARS SHDGGLTTAE TLSHDGPGAG GGDAGDSEPA AKRQKRSSRD  2820
STQLQPLTLP GLDLGALVAA GGGGVAGGTP PAVSPRAGDV STRRLLLGPG GPSEDAVAQL  2880
LLQHQAGLRG AVPGPAGGGG GGGSGRPPIS PPLNLLGGPM AASSGLRLVP LAPSQAVGGG  2940
GPGRDEGLTR MRLSTPPLPL AVPGGGAGGS SSQALAEIIS SLKGGGALGG GGGSGGIERL  3000
LQADGPAGGL QPARGANGSG SGWQQTDME*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1490500RDRDRDRERER
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre06.g264400.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3DMY90.0A0A2K3DMY9_CHLRE; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP28831010
Representative plantOGRP32151725
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.12e-14MYB family protein