PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10010018m
Common NameEUTSA_v10010018mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family Trihelix
Protein Properties Length: 439aa    MW: 50382.8 Da    PI: 6.346
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10010018mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix859.7e-27113207185
         trihelix   1 rWtkqevlaLiearremeerlrrgk.........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdq 84 
                      +Wt+++v++Li+a++++++++  +          +kk++W++vsk+m+erg+++sp+qC++k+++lnkrykk++++ +++ +++++++ +++d+
  Thhalv10010018m 113 KWTDKMVKLLITAVSYIGDDSTMDTgsrrkfavlQKKGKWKSVSKVMAERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGtSCQVVENPALLDS 206
                      7**************8777666443336667788**********************************************66999999888876 PP

         trihelix  85 l 85 
                      +
  Thhalv10010018m 207 I 207
                      5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.0E-24111234No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 439 aa     Download sequence    Send to blast
MDGNFPQGSV VRGGASSYGG FDLQGPTRVH HPDSMNQHRH NPNSRTLHEG LPFTMVTGQT  60
CDHHNMSITE QHKGEREKIS VSDDDEPSFT EEGGDGHNEA NKSTKGSPWQ RVKWTDKMVK  120
LLITAVSYIG DDSTMDTGSR RKFAVLQKKG KWKSVSKVMA ERGYHVSPQQ CEDKFNDLNK  180
RYKKLNDMLG RGTSCQVVEN PALLDSIGYL NDKEKDDVRK IMSSKHLFYE EMCSYHNGNR  240
LHLPHDLALQ RSLQLALRNR DDHDNDDSRK HQMMEDVDDE DHDGEGDEHD EYEEQHFSHG  300
DCRGVHYGGG GPLKKMRQSH SHEDADHPNH VNSLECSRGS LPQMPFPQAD VNQGGAESGR  360
AASVQKQWIE SRTLQLEEQK LQIQVELLEL EKQRFRWERF SKKRDQELER MRMENERMKL  420
ENDRMGLELK QRELGVEL*
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10010018m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0458500.0AY045850.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankAY0913780.0AY091378.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
GenBankF16F40.0AC036104.3 Sequence of BAC F16F4 from Arabidopsis thaliana chromosome 1, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006416324.10.0uncharacterized protein LOC18992820
TrEMBLV4KX910.0V4KX91_EUTSA; Uncharacterized protein
STRINGXP_006416324.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.10.0sequence-specific DNA binding transcription factors