PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Zjn_sc00004.1.g05900.1.am.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Chloridoideae; Zoysieae; Zoysiinae; Zoysia
Family Trihelix
Protein Properties Length: 1298aa    MW: 139276 Da    PI: 7.1212
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Zjn_sc00004.1.g05900.1.am.mkgenomeZGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix96.62.3e-30287371187
                      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpy 81 
                                   rW+++e+laLi++r+em+ ++r++ lk+plWe+vs+k+++ g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++
  Zjn_sc00004.1.g05900.1.am.mk 287 RWPREETLALIRIRSEMDATFRDATLKGPLWEDVSRKLADLGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YRF 365
                                   8*********************************************************************866665..*** PP

                      trihelix  82 fdqlea 87 
                                   f++lea
  Zjn_sc00004.1.g05900.1.am.mk 366 FSELEA 371
                                   ***985 PP

2trihelix108.93.3e-34665750187
                      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpy 81 
                                   rW+k ev+aLi++r +m+ r++++  k+plWe++s+ mr+ g++rs+k+Ckekwen+nk+ykk+ke++kkr +e+s+tcpy
  Zjn_sc00004.1.g05900.1.am.mk 665 RWPKAEVHALIQLRMDMDMRYQETGPKGPLWEDISAGMRRLGYNRSSKRCKEKWENINKYYKKVKESNKKR-PEDSKTCPY 744
                                   8*********************************************************************8.99******* PP

                      trihelix  82 fdqlea 87 
                                   f+qlea
  Zjn_sc00004.1.g05900.1.am.mk 745 FHQLEA 750
                                   ****85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007173.3E-4284346IPR001005SANT/Myb domain
CDDcd122031.45E-30286351No hitNo description
PfamPF138373.1E-21286372No hitNo description
PROSITE profilePS500907.039286344IPR017877Myb-like domain
PROSITE profilePS500907.34658722IPR017877Myb-like domain
SMARTSM007170.0011662724IPR001005SANT/Myb domain
CDDcd122034.20E-31664729No hitNo description
PfamPF138372.3E-23664751No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 1298 aa     Download sequence    Send to blast
MYTAGLLQAL SLTLSLGQAP SHHHQVTSGA RGVFACFFLP SSLALDPFQL STLPPIFFTI  60
SLSLSLSLSL SLSLSLYHPT PSLPRRALRP AGPFPSPTCT SPSFPISSSP SSFHSSPFSL  120
SLLAHTTLSL GASVGGRGGG SWAPGATRPA IQGLAFGCGC YPHAASSSKN RLVAAAGKEK  180
LKFNREEGEL VAAPMQHQPQ HQGSESPYGA APPDMGPFSP PAASSAMPLS SRPPSSSQEQ  240
QQLTPSYEEL TAVSEAGAGA GASFPDDEML GGGSGGGGGS GTPGGNRWPR EETLALIRIR  300
SEMDATFRDA TLKGPLWEDV SRKLADLGYK RSAKKCKEKF ENVHKYYKRT KEGRAGRQDG  360
KSYRFFSELE ALHAAAPQPP ASSAPQMHAF AAPVSAAPPM NPLPPPAAGG PMQPAPISSA  420
APAPFELPPP QPLNLQGLSF SSMSDSESDG ESDDDDMTAE TGGGQEQLGK RKRGGGGKKM  480
MSFFEGLMQQ VVERQEEVQR RFLETMEKRE AERTAREEAW RRQEVARLNR EQDQLAQERA  540
AAASRDATII AFLQRIGGHS VQPPAAVIVP MSAHMTVQTP PPPPKQPPRQ QPTQATPPPK  600
PISASPLQQQ PPQQQYKETS PQNVSTPRGA PPTPGSAASL ELVPTSEQHV DLALGGEGGE  660
ASSSRWPKAE VHALIQLRMD MDMRYQETGP KGPLWEDISA GMRRLGYNRS SKRCKEKWEN  720
INKYYKKVKE SNKKRPEDSK TCPYFHQLEA IYLKKQQSGG TAASSANVVA AATTPAFTSQ  780
LNQSRQEIEG KNINDDKRNN GGSSGGGAQV PPSNGDPAPP LTAALDVDSG TKKPEDIVRE  840
LTERPPREVM TDETDSDDMG DEYTDDGEEG EDDGKMQYRI QFQSPNSGGN NSAPAPAPAP  900
STTTTPPGPA SAPTTTNTFA AMVQPRTTKR PADVADTDDG SGKMGSLGVS RDWIGSEGII  960
SARKVTLPVA HLARLTPGQL TPHMTMRDLL LRDPREGADT HSALQRMIRP SLSNHERSLA  1020
RCLGCLQSSS VMVSAPAPTI IPVGGVVNLT TSSGATNASV CHASTPRGES GVQAAACRVR  1080
TCSQLSTPRR PSPFLSPGCA TIITPPTRGI LMSDYILLHA DAFRGIITNW DSTAARATTG  1140
HNLTVIASLR CPECPFHSTI LFANVPGIDF TDEHPRIVRA VEDLILLRVP FGGSEDEDEY  1200
FIYRAGGEKG SSLRLLPRPA RTTTFCDEDV GRLRRGEEHY TFRDEDVGLL RRGEEHYTVA  1260
ALLPLGKSNV YELHRFDSVT EKWSTDKVPL VEPQFSFP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1329337KRSAKKCKE
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqNP_001344790.10.0uncharacterized LOC100279868 isoform 2
TrEMBLA0A1D6E4F10.0A0A1D6E4F1_MAIZE; GT-2-like 1
STRINGGRMZM2G016649_P030.0(Zea mays)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP60638175
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.12e-39GT-2-like 1