PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Zmw_sc03489.1.g00010.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Chloridoideae; Zoysieae; Zoysiinae; Zoysia
Family Trihelix
Protein Properties Length: 1393aa    MW: 151098 Da    PI: 9.1517
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Zmw_sc03489.1.g00010.1genomeZGD-
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix96.42.5e-30436520187
                      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpy 81 
                                   rW+++e+laLi++r+em+ ++r++ lk+plWe+vs+k+++ g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++
  Zmw_sc03489.1.g00010.1.am.mk 436 RWPREETLALIRIRSEMDATFRDATLKGPLWEDVSRKLADLGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YRF 514
                                   8*********************************************************************866665..*** PP

                      trihelix  82 fdqlea 87 
                                   f++lea
  Zmw_sc03489.1.g00010.1.am.mk 515 FSELEA 520
                                   ***985 PP

2trihelix108.83.6e-34815900187
                      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpy 81 
                                   rW+k ev+aLi++r +m+ r++++  k+plWe++s+ mr+ g++rs+k+Ckekwen+nk+ykk+ke++kkr +e+s+tcpy
  Zmw_sc03489.1.g00010.1.am.mk 815 RWPKAEVHALIQLRMDMDMRYQETGPKGPLWEDISAGMRRLGYNRSSKRCKEKWENINKYYKKVKESNKKR-PEDSKTCPY 894
                                   8*********************************************************************8.99******* PP

                      trihelix  82 fdqlea 87 
                                   f+qlea
  Zmw_sc03489.1.g00010.1.am.mk 895 FHQLEA 900
                                   ****85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007173.3E-4433495IPR001005SANT/Myb domain
CDDcd122031.62E-30435500No hitNo description
PfamPF138373.4E-21435521No hitNo description
PROSITE profilePS500907.039435493IPR017877Myb-like domain
SMARTSM007170.0038812874IPR001005SANT/Myb domain
PfamPF138372.5E-23814901No hitNo description
PROSITE profilePS500907.189814872IPR017877Myb-like domain
CDDcd122034.72E-31814879No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 1393 aa     Download sequence    Send to blast
MEKEILFLQY LQDSKAADAV TTGEIDRWID GQQRQRHAGR GGTAGAPHAM HAARDCVSRL  60
QRGRRRCQRS PVVRARAPHL GAAQGYGSVS RRVQSIHPKD PSIHPQPPRL INFPYLSTTS  120
PGYKTGAASA VTDSSGVDRR VALSLTLSLG QAPSHHHQVT SWARGVFACF FLPSSLALDP  180
FQLSTLPPIF FTISLSLSLS LSLPSHPIPS PPCPPAGRSL PFPYFFHSSP FSLSPRPHHT  240
HSVCPWEGEE EGAGHRAARL ELPLLSYNSV LRLLLAAFLP SLYSSSYKYA GTQGATRPAI  300
QGLAFGCGCY PQAASSSKNR RVAAAGKEKL KFNREEEELV AAPMQHQPQH QGSESPYGVA  360
PPDMGPFSPP AASSAMPLSS RPPSSSQEQQ QLTLSYEELT AVSGAGAGAG ASFPDDEMLG  420
GGSGGGGGSG TPGGNRWPRE ETLALIRIRS EMDATFRDAT LKGPLWEDVS RKLADLGYKR  480
SAKKCKEKFE NVHKYYKRTK EGRAGRQDGK SYRFFSELEA LHAAAPQQPA SSAPQMHAFA  540
APVSAPPPMN PLPPPAAGGP MQPAPISSAA PAPFELPPTQ PLNLQGLSFS SMSDSESDGE  600
SDDDDMTAET GGGQEQLGKR KRGGGGGKKM MSFFEGLMQQ VVERQEEMQR RFLETMEKRE  660
AERTAREEAW RRQEVARLNR EQDQLAQERA AAASRDATII AFLQRIGGHS VQPPAAVIVP  720
MSAHMTVQTP PPLPKQPPRQ QPTQATPPPK PISASPLQQQ PPQQQYKETS PQNVSTPRGA  780
PPTPGSAASL ELVPTSEQHV DLALGGEGGA ALSSRWPKAE VHALIQLRMD MDMRYQETGP  840
KGPLWEDISA GMRRLGYNRS SKRCKEKWEN INKYYKKVKE SNKKRPEDSK TCPYFHQLEA  900
IYLKKQQSGG TAASSANVVA AATTPAFSSQ LNQSRQEIEG KNINDDKRNN GGSSSGGAQV  960
PPSNGDPAPP LTAALDVDSG TKKPEDIVRE LTERPPREVM TDETDSDDMG DEYTDDGEEG  1020
EDDSKMQYRI QFQSPNSGGN NSAPAPAPST TTTPPGPASA PTTTNTFAAM VQPRTTKRPA  1080
DVADTDDGSV IKKWRRGSSS STTGEKRTDR STGEQGKMGS LGVSRDWIGS EGIISARKVT  1140
LPVAHLARLT PGQLTPHMTM RDLLLRDRMI RPSLSNHERS LARCLGCLQS SSVMVSAPAP  1200
TIIPAGGVVN LTTSSGATNA IRGTWPPYVM ASSGRRYVVT GRWSVAHGEL VNSDDEVLRL  1260
PGTYQQGATP RRHAGKAEFR PQLAASRRDL ILLRVPFGGS EDEDEYFIYR AGGEKGSSLR  1320
LLPRPARTTT FCDEDVGRLR RGEEHYTFRD EDVGLLRRSE EHYTVAALLP SGKSNVYQLH  1380
RFDSVTEKWS TDK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1478486KRSAKKCKE
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqNP_001344790.10.0uncharacterized LOC100279868 isoform 2
TrEMBLA0A1D6E4F10.0A0A1D6E4F1_MAIZE; GT-2-like 1
STRINGGRMZM2G016649_P030.0(Zea mays)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP60638175