PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cc02_g11460
Common NameGSCOC_T00029409001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Coffeeae; Coffea
Family MYB
Protein Properties Length: 1731aa    MW: 188323 Da    PI: 6.4275
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cc02_g11460genomeCGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.43.7e-09840881346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e + d  + +G++ +++Ia+ +  ++t  +c+++++k
      Cc02_g11460 840 PWTSEEKEMFMDMLAVHGKD-FTKIASFLV-HKTTADCVEFYYK 881
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding33.41.1e-1010591099345
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                        WT eE  ++++av  +G++ ++ I+r++  +R+++qck ++ 
      Cc02_g11460 1059 DWTDEEKAIFIQAVSSYGKD-FAMISRYVS-TRSREQCKVFFS 1099
                       5*****************99.*********.********8876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.94E-13824885IPR009057Homeodomain-like
PROSITE profilePS5129314.257836887IPR017884SANT domain
SMARTSM007176.3E-9837885IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.605.6E-6839884IPR009057Homeodomain-like
PfamPF002491.5E-6839881IPR001005SANT/Myb domain
PROSITE profilePS5129312.10310551106IPR017884SANT domain
SMARTSM007178.5E-1010561104IPR001005SANT/Myb domain
SuperFamilySSF466894.45E-1110571106IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.9E-710591100IPR009057Homeodomain-like
PfamPF002491.6E-810591099IPR001005SANT/Myb domain
CDDcd001675.19E-810601098No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1731 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RQEPYHHHHH HHHLHPTGGG GGGGGGGYGG GIGGFGGGPR  60
WREPPHPHPH PYHYASPRWV SDFRYRPPPG YGKQGGRHLY PEESSHGFVP SRPSDRVFED  120
ENCRASVSGK YSRSNRESRG PFGQKDWKGQ SWEATPSPNA PGRPLETSDQ HRSVDEMQTC  180
TSSHPHLDSA NSWDQSHLKD QHEKSSGVVN ALGSSGQRLE RENSLGSMDW KPLKWTRSGS  240
LSSRGSGFSH SSSSKSMGAD SNEMKAEVQP SNVTPVQSPS GNAATPVAAP AAAYETSAGA  300
SEEMSSRKKP RLGWGEGLAK YEKKKVEGVD DTTLKNGTII CSSSREPLHL HSSHLADKSP  360
RITAFSDCAS PATPSSVGCS SSPGLEEKQF IKAPSVDNEA TNLSPSIVSQ DHRDHIEGAT  420
FDLENLDLAE SGHFNSAINE LLLSDDLISV DSGFVKSTAI NKLLVWKGDV LKKLEMTESE  480
IDRLEGELKT LASIPESSCH HPAVSSSLPM DCFSKPAEEQ DVTSSISHRP ALLDLGSSGH  540
NDAEKMPNVL VDDHTEVKDE DVDSPGSATS KFVEVVSSGK DASPSELGNE PGIDSVCISN  600
TDCAMSKNLE LRYVGNGVHE DNGGENFQLV ASCSPTHLDE ISLCDDKELK LCESIFASNK  660
ESASRAAEVF NKLLPADLCK FDISGVCSLK SNPMVKENFL RRKRFQQFKE RCIALKYRAL  720
QHLWKADVCS LSMRRFRVKS HKKLDLSLRT VLNSSQKHRS SFRSRLSSHD GNVSSGSNTV  780
MMNFISKLLS DSQVKPCRDT LKMPAMILDK KEKMISRFIS SNGLVEDPSA VEKERSMINP  840
WTSEEKEMFM DMLAVHGKDF TKIASFLVHK TTADCVEFYY KNHKSDCFKK TKKHPEYPKQ  900
GKSYTANNYL VASGKRWHCE ANAASLDILG AASAIAANVD HGMEIQQTPT SKYLLGRSSD  960
YKSSKGDNGL LERPSSLDAD NNERETVAAD VLAGICGSLS SEAMSSCITS AVDPGEGYRE  1020
WKYSRVGSSS RLPLTPEAMQ NGDEETCSDE SCGEMDPTDW TDEEKAIFIQ AVSSYGKDFA  1080
MISRYVSTRS REQCKVFFSK ARKCLGLDMI SPGPGNVVRR DASGGSDTDD VGVVETGSIT  1140
CSEKSGVKLE VDLPCPEVKL NIEPDSAGLA NVNPDLNRLE EISGTGDRAA VEAGLQSKNL  1200
TDDSQMEEKP EQEADGSGDI QSVPSGEVEQ GTAVTTTGVG DTSDSANTLD TQIHSGALEK  1260
RDEHLDAEME GLSPVSWESS INDRKEKDDA NQKDVNGMDQ DLKSTPHGDI SGDRQIGVLE  1320
TDSAGKPCVG PIEQNGFPAP MKSVPQSCAV KCQTPNEATL SALEVVKISG EQGHQVTRVG  1380
EKLRSGSSLL GSVDPCHILK GYPLPPSTTR EVNGNSSCRR SATPQSIPKL GNNFHRDCHL  1440
ARDSYLQKCN GVKHYSSIAE LPFKFREQSR DTNPDHQSGS LSDVEKPRRN GDVKLFGQIL  1500
TKPSYQPKSS SSRQQNGGNE NQQSKIGKPL GTKFASDQAI GGNLSQTKLD RNNLLGTENL  1560
PVRSFGYWDG SRIQTGLHSL PDSAMLLAKY PAAFGNYVLP SSKLEQLPVH GVNNGERNLN  1620
GSAVFPAREI GSSNAAAAAA ADYQAYRSRE LQPFTLDMKQ RQDAVLSEMH RRNGFDVVSG  1680
MQQAARGLVG INVVTAIKMH YSKAEQLNGG QTASIIREDD SWRGKGSIGR *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C1e-16798889494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D1e-16798889494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027155746.10.0uncharacterized protein LOC113756173
TrEMBLA0A068TU320.0A0A068TU32_COFCA; Uncharacterized protein
STRINGVIT_13s0019g04010.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA59042332
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-108MYB family protein