PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre12.g514400.t1.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family MYB_related
Protein Properties Length: 3287aa    MW: 314416 Da    PI: 7.5328
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre12.g514400.t1.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding49.97.1e-165094147
                        TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
     Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47
                        r rWT +E++++v+a k++G   W++I +++g ++t+ q++s+ qky
  Cre12.g514400.t1.2 50 RERWTDDEHQRFVEALKLYGRA-WRKIEEYVG-TKTAVQIRSHAQKY 94
                        78******************88.*********.*************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.48E-1644100IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.2E-84598IPR009057Homeodomain-like
PROSITE profilePS5129423.2244599IPR017930Myb domain
TIGRFAMsTIGR015576.8E-174897IPR006447Myb domain, plants
SMARTSM007174.4E-134997IPR001005SANT/Myb domain
PfamPF002493.7E-135094IPR001005SANT/Myb domain
CDDcd001671.92E-85295No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009409Biological Processresponse to cold
GO:0009651Biological Processresponse to salt stress
GO:0009723Biological Processresponse to ethylene
GO:0009733Biological Processresponse to auxin
GO:0009737Biological Processresponse to abscisic acid
GO:0009739Biological Processresponse to gibberellin
GO:0009751Biological Processresponse to salicylic acid
GO:0009753Biological Processresponse to jasmonic acid
GO:0010243Biological Processresponse to organonitrogen compound
GO:0042754Biological Processnegative regulation of circadian rhythm
GO:0043496Biological Processregulation of protein homodimerization activity
GO:0045892Biological Processnegative regulation of transcription, DNA-templated
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0046686Biological Processresponse to cadmium ion
GO:0048574Biological Processlong-day photoperiodism, flowering
GO:0005634Cellular Componentnucleus
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 3287 aa     Download sequence    Send to blast
MAHQFNGPGG PGSSVSPPPQ LKQEAVPAAQ NAAEPPPSKT RKQYTVSKRR ERWTDDEHQR  60
FVEALKLYGR AWRKIEEYVG TKTAVQIRSH AQKYFNKLEK AGDAAEIQVP PPRPKRKSVT  120
AAQRAAAAAA AAEQQLNPRS GSEERGYGDL QDDNDAGSEG GAEDGEPGYG QHDQMGSPDD  180
VPLAADLQGL LLQPPRNGAA AAAAAAAAAA AAGDLGPLAA AAGLPQLPVL QQQQQEQSAV  240
MQQQLQELQQ LQLQPFTAPG LMPFRGAAGA AAVQLPVLAA AAAAGQLLAA AAAGALALPP  300
APSPPPVLLG GALPPVGVGA LGQQPQQQPQ QATPSGTTLS AQAPMQMQAS PQQQQSQQQQ  360
QQQAPAGSAG LSAGAGAPGA SNGGFSLADS IARNIAINMA QWAQAQVQVP DAAPAPAGAP  420
DRQTVEAVAA AAAAAAAAAA TAVISAAGEA IQQRIQEKAA AGYLPFLVHP LAELIPPSLA  480
ARSGSAATTS DQSAPPWALP VPQQSLMHVL AGSVSTGGGA AGTHPHSAPP VSSRQQQQQQ  540
QQQQQQQQQQ QQQQQQQQQQ QTAPFSLGAA SGGAAGNAEQ QLTATRDDTH TGQHTGGTGG  600
TVTATPGSSA QQQGGAVSGP GNSPQHMSAS FAAVAAGGGS GGVASVSMAG GLGRLPSSSL  660
VASLGGPSVS LGGAAAMSLG VDLQPPQLLA PVASPPRMEW EVAVTVARAS QQQLQLQLQQ  720
QQQQQQQRQT APGFGLLAPQ PMAREGSAAA GLLRTLSLPA AWDSGGAAAQ AAAALQSREV  780
LSRLAAAYTS DVPQLQAVVA ATAAQQQGLP QGLLQPQPQL LQLVEQLPSR PSVELAQSPV  840
TLPGGGGSSS AFQVLRPATS QQQLEVSLDP RVRAVVDPRL QPQQQPRHVR SQSAAPGGAQ  900
PLPQARAEAQ GQQEAPAQAQ AQAQAQQEAQ AQAQAQAPYI KAEVLTEDEE RGRERERDPH  960
RHHHHHHHHA HHHHHHHSHH GSNHGGTYGG AHSRGIGSSA EPERGSRNPV DRRSRSVASI  1020
RSGGGCEERQ MHSGSGGGAG GNGLTRIGAA AAAAAAAVAA AAEEAAAQEQ VDKSSGASGC  1080
DNGSPAVSPD PRQLPLAVRG HGISPSAGAE GQLAHAGGGG TTGARVRFEA HVRSSTSEGR  1140
AHTGQQPQAT AAAAVAAAAV AGAGHRATAH RHGVHSSHRH RSSGGRGSAA AAAAATAAAS  1200
GAMDTDASAE PGGGARHRAG SRAAWLMDDY GAAASGDGAY QLGQLQYLPQ QFMPQQSGPD  1260
NGGLSGTTGS AFNSLQEVLQ GPDPAAMVPL LAPTSGVGAC GVQDAATTAQ LARLQLGAAH  1320
RSQAAERRVA AAERAAAAER AADGEGQEGG SGGNGVTDVN TYIRGDYSLL SPSGGNGGSG  1380
GYAAQLAGAV ATAAGSGGGG GSGGGSGGGG VPLSSSGPSS VPSTQQQGAN GGAATAMAAH  1440
RHSAQQTKQR AAAAAAAAAG AGGSDGNGAG ASNEMQQGSN QNAGSGAGSG QAGEAAGGVA  1500
QATGVALALP PPLPRSLLHV AARQGAPEAP SAERADAVGQ GACGGAAGLN AVSPTAGQPG  1560
GGSNTGSGSG DGSGAGGPPD VYQSSGPTGL ARAAAHAAGL RQETPPTGVS AGTRRGAGGS  1620
GSGNADGGGG ATGSGSGAGQ GSNPSGNGAG EGSGSGAAPA AGTAAGAANT GGVGSTQAAA  1680
GPHALQHPHA HGNAHQQQLQ QVQLQQQQQQ QQQGSGSGGE RGGSGSGAGS GSRVGSGVPA  1740
GTYAAREAMP GASGIGRHVP PYVVAAQRAQ HERVEAAAAK AAAATVGRDG EAGQADGSKP  1800
ISSPEADAEG ALLGSGGGSG GGAVVDLAVA LRHKQRGSPG GSGGMMGSQA RGDSPAAALT  1860
DPRLLQRGGS ALALAAGAAG RVASPPPNGQ STLPVPGDTL ALLHQMLSQQ QQQLQQAQAQ  1920
VVQQQQAQAA AAAAAGQDGG AAAVGSLAPM APPPHGLMTT LAGLSAGTQL AATLLALHEQ  1980
QQQQQQQEQL LATNRGGGSA AAGAGRSNGF LSLAAAAADP TSSGGGSSAT APPPLMALGP  2040
GALTALLPYL TGERPVESEH VARLEEVARA LQYASSLAGP LAAVLHPPGT SGGGAVDAPP  2100
TDPRRPDLAA LFSSGGGGGG SSTTSRLTAD VLDVMRGAGP SRLLGPPAAQ LASARDGLPS  2160
SDLAAAALMA QWQNAATMYG LQPLLRTSSA AAAAAAAPGS ATSGGSAAAP PPPSSAAAAA  2220
ALLGFPPHPL LIQQHHLQQQ LAASGQAQAA ASYLQSLGLA SGLGLGFNMA ANMGSRSALG  2280
ALSLGPMGLS LPTFASGGGG GNGTASGGAL GAAAAAGGGG GAEQSPRPSA GTIGSGGGVH  2340
GLMPAPTRSH HQQHSHQQHQ QAAGAPSDPR LSGGNPRRGS GDGAVGFKPP RPLASLGGVD  2400
GSGGVARGLG SGPSMSGEKL AAAAAAAAAA GGNGGTSKRS LNELYGALVA AGAGAGGAAG  2460
GGGGGSGQGV QRSGRQRNGR RRHHRGHSPT RSVGSGASSS GFHCGDKSGG EEANYGPRPS  2520
REYKEGSPQG GGAVEQQDTA TSSGREEQGS AGAARGCGAA GEVGTAGQSG GGAGGGAAAA  2580
NAAAAANAAA DGGGGMALPP AKHARHDLMT GPNGKLAAAA GHGGGAGGAS GQGTGGAAAK  2640
AAAQRPPKDA TPSSGGGSGG SGGGSTGGNK SGTSNDAYDG SMNASDEGSN EPGSAPADGP  2700
MGQALAPSDG MGMLGAGDPG LNVQGMQMPG AGMMGAGGAR SFSVAQPEPG PGHASIMAAQ  2760
PGGCGGGGST GVAGAGAGAG GGAAGGGGGT AGHKRVRYDA GPAGGGTAAA AGAPAAGAPL  2820
TPAAAGAAGA MQQRAGALQA DGAGTGTSSP SETGGGDDSR GAAQPPHKVQ RTNAGGAAAV  2880
AEAGAGGAAG HAADACDGNN RSGGSAGAGS GQAQSGRGRF AAPPSNQQHP HGAAAAAAAA  2940
GGAGGGAAQR TSGSLPSHLG YVLEQASSKQ GGNDGSGNNN SGTGPRSGVG SNEGGAAGGA  3000
GGSRSGEGGN GNGSQGNGGS HGHGGSHGNG QGSNANGSHG HGSNGNNGHG SNGNGASTNL  3060
PGGAPYAYFR TPLSLQQQPR GGGGGGGGGG GGGAGSGGSG GSGGNRGGGA QGSNGGNGDG  3120
AGSGGHGSGA HDGASGGAAG PSPYDAGAPT SGAAADHMHR HLHHHHFTHA HHHYGGGGGG  3180
GPQEGTSMRT PLPLQPQQPQ PQPQQQQQQQ QQQQQQAGTG AANAGQGMAL AAVAAAAAAA  3240
PAVAGTSGGA AVVAAAAAAA AAVSQQQPVH VEGQGQDGNE PMLQDG*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
113951405SGGGGGSGGGS
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00103PBMTransfer from AT2G46830Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre12.g514400.t1.2
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3D3P80.0A0A2K3D3P8_CHLRE; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP4541427
Representative plantOGRP12551549
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G02840.29e-30LHY/CCA1-like 1