PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10018336m
Common NameEUTSA_v10018336mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family Trihelix
Protein Properties Length: 581aa    MW: 66076.4 Da    PI: 7.8327
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10018336mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix957.1e-3043127187
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                      rW++ e+laL+++r+em++++r++ lk+plWe++s+km+e g++rs+k+Ckek+en+ k++k++keg+ ++++++  t+++f++lea
  Thhalv10018336m  43 RWPRPETLALLRIRSEMDKAFRDSTLKAPLWEKISRKMMELGYKRSAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFEELEA 127
                      8********************************************************************975544..6******985 PP

2trihelix108.83.5e-34401486187
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                      rW+k ev aLi++r+++e +++++  k+plWee+s+ mr+ g++rs+k+Ckekwen+nk++kk+ke++kkr + +s+tcpyf+qlea
  Thhalv10018336m 401 RWPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRSAKRCKEKWENINKYFKKVKESNKKR-PLDSKTCPYFHQLEA 486
                      8*********************************************************************8.9************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.02140102IPR001005SANT/Myb domain
CDDcd122035.17E-2542107No hitNo description
PROSITE profilePS500906.93442100IPR017877Myb-like domain
PfamPF138378.9E-2042127No hitNo description
PROSITE profilePS500907.526394458IPR017877Myb-like domain
SMARTSM007179.2E-4398460IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.9E-4400457IPR009057Homeodomain-like
CDDcd122031.07E-26400465No hitNo description
PfamPF138377.2E-24400487No hitNo description
SuperFamilySSF466893.97E-5401493IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 581 aa     Download sequence    Send to blast
MSGNSSGLLD SSGGGGVGGS GEEEKDMKME ETGEGGGGGG GNRWPRPETL ALLRIRSEMD  60
KAFRDSTLKA PLWEKISRKM MELGYKRSAK KCKEKFENVY KYHKRTKEGR TGKSEGKTYR  120
FFEELEAFET LNSYQPEPES QPENSHAATM TTTSLTPWIS SNNPPADKSS SPLKHHHQVS  180
VKPITTNPTF IAKQPSPTTP FPFYSNNHTT TADTGFKPTS NDLVKNVSSA LNLFSSSTSS  240
STASDDEDGH HQGKRSRKRR KYWKGLFTKL TKELMEKQEK MQRRFLETLE NRERERISRE  300
EAWRVQEVAR INREQETLVH ERSSAAAKDA AIISFLHKIS GGQQQPQQHA HQNHKVSQRK  360
QYQSDHSITF ESKEPRPVLL DTTMKMGNYD TNHSVSPSSS RWPKTEVEAL IRIRKNLEAN  420
YQENGTKGPL WEEISAGMRR LGYNRSAKRC KEKWENINKY FKKVKESNKK RPLDSKTCPY  480
FHQLEALYNE RNKSGVLPLP LPLMVTPQRQ LLLPHETQNE SETDHMDQVG NKEGEEEGES  540
EEDGYEEEEE EGEGDNETSE FEIVLNKSSS PMDINSNIFT *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18593KRSAKKCKE
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10018336m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0871170.0AY087117.1 Arabidopsis thaliana clone 3190 mRNA, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006390147.10.0trihelix transcription factor GT-2
SwissprotQ391170.0TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLV4KJH00.0V4KJH0_EUTSA; Uncharacterized protein
STRINGXP_006390147.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM59952847
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.20.0Trihelix family protein