PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10018297m
Common NameEUTSA_v10018297mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family Trihelix
Protein Properties Length: 613aa    MW: 68340.7 Da    PI: 6.0802
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10018297mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.11.2e-2857141187
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                      rW++qe+laL+++r++m+ ++r+++ k+plWeevs+km+e g+ r++k+Ckek+en+ k++k++keg+ ++++++  t+++fdqlea
  Thhalv10018297m  57 RWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGYIRNAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFDQLEA 141
                      8********************************************************************975544..6*******85 PP

2trihelix105.24.6e-33411496187
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                      rW+k e+ aLi++r++++++++++  k+plWee+s+ mr+ gf+r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Thhalv10018297m 411 RWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 496
                      8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.1254116IPR001005SANT/Myb domain
PfamPF138376.0E-1856142No hitNo description
PROSITE profilePS500906.95756114IPR017877Myb-like domain
CDDcd122032.81E-2256121No hitNo description
PROSITE profilePS500907.189404468IPR017877Myb-like domain
SMARTSM007170.0017408470IPR001005SANT/Myb domain
PfamPF138373.4E-22410497No hitNo description
Gene3DG3DSA:1.10.10.608.5E-4410467IPR009057Homeodomain-like
CDDcd122036.26E-27411475No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010192Biological Processmucilage biosynthetic process
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 613 aa     Download sequence    Send to blast
MMQLGGTTTP AASSAATAAA PPQSNDSAAT EAAAAAAAVG AFEVSEEMSD RGFGGNRWPR  60
QETLALLKIR SDMGIAFRDA SVKGPLWEEV SRKMAELGYI RNAKKCKEKF ENVYKYHKRT  120
KEGRTGKSEG KTYRFFDQLE ALETQSTSSL HHQQQQPPQP QPQPLQPPLN NNNNSSLFST  180
PPPVTTVMPP MTSITLPPSS IPPYTQPVNI PSFPNISGDF LSDNSTSSSS SYSTSSDVEI  240
GGTTASRKKR KRKWKDFFER LMKQVVDKQE ELQRKFLEAV EKREHERLVR EETWRVQEIA  300
RINREHEILA QERSMSAAKD AAVMAFLQKL SEKPNPQGQP IAPQPQQTRS QMQVNNHQQQ  360
TPQRPPPPPP LPQPTQPVTP TLDATKTDNG DQNMTPASAS AAGGAAASSS RWPKVEIEAL  420
IKLRTNLDSK YQENGPKGPL WEEISAGMRR LGFNRNSKRC KEKWENINKY FKKVKESNKK  480
RPEDSKTCPY FHQLDALYRE RNKLHSNNNN NNIASSSSTS GLIKPDDSVP LMVQPEQQWP  540
PATAATTTTT VAAAQTDNHP HPSQPSDQNF EDEEGTDEEE YDEEEEEEEN EEEEGGEFEL  600
VPSNNNKTTN NL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1246251RKKRKR
2246252RKKRKRK
3247252KKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00243DAPTransfer from AT1G76880Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10018297m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006390148.10.0trihelix transcription factor GT-2
SwissprotQ391171e-146TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLV4K8S00.0V4K8S0_EUTSA; Uncharacterized protein
STRINGXP_006390148.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-139Trihelix family protein