PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSMUA_Achr2P00580_001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Zingiberales; Musaceae; Musa
Family Trihelix
Protein Properties Length: 701aa    MW: 78745 Da    PI: 9.9608
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSMUA_Achr2P00580_001genomeCIRADView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix84.71.2e-2655153187
               trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm..............rergferspkqCkekwenlnkrykkikegekkrtse 74 
                            rW+++e+laL+++r+em+ ++r +  k plWe+vsk++              +e g++rs+k+Ckek+en++k+yk++keg+ ++  +
  GSMUA_Achr2P00580_001  55 RWPRKETLALLKIRSEMDVAFRGATFKSPLWEDVSKVVtvslgsllhhlvklAEMGYKRSSKKCKEKFENVHKYYKRTKEGRAGQ--Q 140
                            8**********************************************************************************96..5 PP

               trihelix  75 ssstcpyfdqlea 87 
                            + + + +f+qlea
  GSMUA_Achr2P00580_001 141 DGKAYHFFSQLEA 153
                            5667******985 PP

2trihelix98.46.3e-31367452187
               trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                            rW+k ev+aLi++ + +e+r++++  + plWee+s++m++ g++rs+k+Ckekwen+nk++kk+k+++k+r +e+s+tcpyf ql+a
  GSMUA_Achr2P00580_001 367 RWPKAEVHALIQLWTGLESRYQEAGPRVPLWEEISANMQRLGYSRSAKRCKEKWENINKYFKKVKDNSKQR-PEDSKTCPYFYQLDA 452
                            8********************************************************************97.99**********985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007173.352128IPR001005SANT/Myb domain
PROSITE profilePS500905.73854126IPR017877Myb-like domain
CDDcd122032.56E-2354133No hitNo description
PfamPF138373.4E-1654154No hitNo description
SMARTSM007170.05275426IPR001005SANT/Myb domain
PROSITE profilePS500906.864366424IPR017877Myb-like domain
PfamPF138375.3E-23366453No hitNo description
CDDcd122035.58E-28366431No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 701 aa     Download sequence    Send to blast
MLRIPGGTDP LHPPAEAASP ISSLPPPDAT ANLDELTPAV AGNEEAEPSG VPGNRWPRKE  60
TLALLKIRSE MDVAFRGATF KSPLWEDVSK VVTVSLGSLL HHLVKLAEMG YKRSSKKCKE  120
KFENVHKYYK RTKEGRAGQQ DGKAYHFFSQ LEAIHNRSGG GAITLSAAVS QPTPASFTAG  180
VLGPHGFSSS AAAANGINFP WNSSSSSTES DEEDTEEAGE NQEGRKRKRS RSSRARRQLM  240
NFSEAIMKQV MERQEAMEQK FLEAIKKREH ERMIREEEWR RQEMALLSRE QELLAQERAV  300
AASRDTAVIS YLQKISGQSK PLPAATTTPR QAQTPQEQKP PPPVPISSET DQGILGSGSF  360
EPPSSSRWPK AEVHALIQLW TGLESRYQEA GPRVPLWEEI SANMQRLGYS RSAKRCKEKW  420
ENINKYFKKV KDNSKQRPED SKTCPYFYQL DAIYRKKLLG HGGRGGGDGD GRGSVGVQQQ  480
QEQDPKCSPV PQERADNVVQ MQHQRQTPSE AKYKNGNDSN RNGGNSKQAQ TSNGGHPPSF  540
FEEGMNKVRR AYPSSISFVM QRPRRPQRLQ VPSCLWFSSY CNLHQFSLLP LGCTTFSFLR  600
QHNSHVRVLA ANTPISVLLL PCPICIIKRD LMWIKNPSAH ISYKMPFQIE RGSHRLRKRE  660
VKEMVYRFTH TACMCQSSFL YSWPQYLAGG NMMEFGAFSN H
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1111119KRSSKKCKE
2226235RKRSRSSRAR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0252967e-46AC025296.10 Oryza sativa chromosome 10 BAC OSJNBa0076F20 genomic sequence, complete sequence.
GenBankAP0149667e-46AP014966.1 Oryza sativa Japonica Group DNA, chromosome 10, cultivar: Nipponbare, complete sequence.
GenBankCP0126187e-46CP012618.1 Oryza sativa Indica Group cultivar RP Bio-226 chromosome 10 sequence.
GenBankCT8305457e-46CT830545.1 Oryza sativa (indica cultivar-group) cDNA clone:OSIGCFA218F06, full insert sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009380043.10.0PREDICTED: trihelix transcription factor GT-2-like
TrEMBLM0S3Z90.0M0S3Z9_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr2P00580_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP60638175
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.26e-60Trihelix family protein