PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10020471m
Common NameEUTSA_v10020471mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family Trihelix
Protein Properties Length: 542aa    MW: 59004.9 Da    PI: 6.8727
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10020471mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix61.61.9e-19155241186
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm..rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                      +W++  + +L++a++++ ++l+rg+l++ +Weev++ +  r +++ +s +qCk+k++nl+kryk ++++++++   s s++p+f+++e
  Thhalv10020471m 155 EWSDAAISCLLDAYSDKFTQLNRGNLRGRDWEEVASSVskRCEKLIKSVEQCKNKIDNLKKRYKLERHRMSSG-GISASHWPWFSKME 241
                      5*************************************99999*****************************9.566679*******8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138372.7E-21153243No hitNo description
Gene3DG3DSA:3.40.1160.101.1E-63288533IPR001048Aspartate/glutamate/uridylate kinase
SuperFamilySSF536331.11E-50296533IPR001048Aspartate/glutamate/uridylate kinase
CDDcd042541.40E-116298534No hitNo description
PfamPF006963.2E-21299513IPR001048Aspartate/glutamate/uridylate kinase
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005737Cellular Componentcytoplasm
GO:0016740Molecular Functiontransferase activity
Sequence ? help Back to Top
Protein Sequence    Length: 542 aa     Download sequence    Send to blast
MASCDDDFSL LGEDQSNPNQ QHHHHQVLHH APYAPRRFTP KPSNQILVPH QQRNGDEDDE  60
NEVVVEASSA FHGVNPFSAD ENSNPYDNNA AVDGDEELDA NRSRIGGLRV EKRQSQEELS  120
DGGTTNGGEN TPYGSFKRPR TSSSSAGEYR KDREEWSDAA ISCLLDAYSD KFTQLNRGNL  180
RGRDWEEVAS SVSKRCEKLI KSVEQCKNKI DNLKKRYKLE RHRMSSGGIS ASHWPWFSKM  240
EEIVGNSLAT KGASDEDRSG SSLGNAVKPA KRYPLVTYSP GVQINNVKSK ATSNPRWRRV  300
VLKISGAALA CTGPNNIDPK VVNLIAREVA MACRLGVEVA IVVGSRNFFC GGTWITATGM  360
DRTTAYHISM MASVMNSVLL QSSLEKMGVQ ARLQTAIAVQ GVGEPYNRQR ATRHLDKGRV  420
VIFGGIGATL GNPLLSSDAS AALRAIDILS SVNAEAMVKG TNVDGVYDCH SQDSNVTFEH  480
ITFRELASRG LTTMDTMALN VCEENSIPVV VFNFLEAGNI TKALCGEQVG TLIDITGRGV  540
S*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
3ek5_A4e-572895331239Uridylate kinase
3ek5_B4e-572895331239Uridylate kinase
3ek5_C4e-572895331239Uridylate kinase
3ek5_D4e-572895331239Uridylate kinase
3ek5_E4e-572895331239Uridylate kinase
3ek5_F4e-572895331239Uridylate kinase
3ek6_A4e-572895331239Uridylate kinase
3ek6_B4e-572895331239Uridylate kinase
3ek6_C4e-572895331239Uridylate kinase
3ek6_D4e-572895331239Uridylate kinase
3ek6_E4e-572895331239Uridylate kinase
3ek6_F4e-572895331239Uridylate kinase
Search in ModeBase
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00340DAPTransfer from AT3G10030Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10020471m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF3754310.0AF375431.1 Arabidopsis thaliana AT3g10030/T22K18_15 mRNA, complete cds.
GenBankAY1206870.0AY120687.1 Arabidopsis thaliana AT3g10030/T22K18_15 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006407614.10.0uncharacterized protein LOC18023904 isoform X1
TrEMBLV4M2L50.0V4M2L5_EUTSA; Uncharacterized protein
STRINGXP_006407614.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM93052631
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G10030.10.0Trihelix family protein