PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSMUA_Achr4P24780_001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Zingiberales; Musaceae; Musa
Family Trihelix
Protein Properties Length: 669aa    MW: 74002.6 Da    PI: 9.1476
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSMUA_Achr4P24780_001genomeCIRADView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix33.11.4e-1098139142
               trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrerg 42 
                            rW++qe+laL+++r+em+ ++r++  k+ lWeev ++ +  +
  GSMUA_Achr4P24780_001  98 RWPRQETLALLKIRSEMDAAFRDATFKGSLWEEVCRYSKPSQ 139
                            8*********************************99876555 PP

2trihelix44.83.2e-141982463787
               trihelix  37 kmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                            k+ e g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++f+qlea
  GSMUA_Achr4P24780_001 198 KLGELGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YRFFSQLEA 246
                            56799*****************************866665..*******85 PP

3trihelix107.31.1e-33392477187
               trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                            rW+k ev+aLi +r+ +e++++++  k++lWee+s+ m++ g++rs+k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  GSMUA_Achr4P24780_001 392 RWPKAEVHALISLRSGLESKYQEAGPKGTLWEEISAGMQRLGYNRSAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 477
                            8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007172295221IPR001005SANT/Myb domain
CDDcd122033.50E-1697226No hitNo description
PfamPF138373.5E-9198247No hitNo description
PROSITE profilePS500907.294385449IPR017877Myb-like domain
SMARTSM007170.12389451IPR001005SANT/Myb domain
CDDcd122031.20E-29391456No hitNo description
PfamPF138371.0E-23391478No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 669 aa     Download sequence    Send to blast
MQQQKAGSQF VVPHSEMAPF SPSAVGSGAH LLGIPGPDPL QQSPMTEAAS PISSRTPARP  60
PTVDFDELAP AVAGNCPDDQ ALAGDEDAER GGGATGNRWP RQETLALLKI RSEMDAAFRD  120
ATFKGSLWEE VCRYSKPSQF PLLSSASTTR IFRSAASIPT NPYFRWRPWF QIPRCKKHAV  180
PPSSLRLLFT IIIIILGKLG ELGYKRSAKK CKEKFENVHK YYKRTKEGRA GRQDGKSYRF  240
FSQLEALYSG SSDGGATTST AKPAPAPPLR KHGGGSGASR KMMAFFDRLM NQVMERQDAM  300
QQRFLEAIEK RDQDRMIRDE AWRRQEMERL NREQELLAQE RVMAASRDTA IISYLQKISG  360
QTTPRAPAQS PQNQNECKQH HKSSEPMPSS SRWPKAEVHA LISLRSGLES KYQEAGPKGT  420
LWEEISAGMQ RLGYNRSAKR CKEKWENINK YFKKVKESNK KRPEDSKTCP YFHQLDAIYR  480
KKLLSSGGTS SGSGNIVGIQ RQQVQEANPP PNQQKSDAVT IMPQEQAPPP PQEQAGSKNG  540
KDGSSNNQNG GNSEGGEVSL GIQVPTSNGG LPSRFFGEGL NKSENFVKEL MGQRQQQAAM  600
DDDYAKLDEA DSDNMDQNDD NDDNDDDDEE DRKMQYTIQF QKQNVNNAGG SGGNGSAAAS  660
PGSFLAIVQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1204212KRSAKKCKE
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0548721e-37BT054872.1 Zea mays full-length cDNA clone ZM_BFc0165O02 mRNA, complete cds.
GenBankBT0677671e-37BT067767.1 Zea mays full-length cDNA clone ZM_BFc0150E20 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009398097.10.0PREDICTED: trihelix transcription factor GTL1-like isoform X4
TrEMBLM0SRT90.0M0SRT9_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr4P24780_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP60638175
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.27e-73Trihelix family protein