PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_7058_f_2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family Trihelix
Protein Properties Length: 717aa    MW: 79158.7 Da    PI: 10.188
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_7058_f_2genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix94.97.6e-3084168187
       trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                    rW++qe+laL++ r++m+ ++r++  k+plWe+vs+k++e g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++f+qlea
  Neem_7058_f_2  84 RWPRQETLALLKTRSDMDAAFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YKFFSQLEA 168
                    8*********************************************************************866665..*******85 PP

2trihelix1071.3e-33448533187
       trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                    rW+k evlaLi++r+ +e r++++  k+plWee+s+ m++ g++rs+k+Ckekwen+nk++kk+ke++kkr +e+ +tcpyf++l+a
  Neem_7058_f_2 448 RWPKVEVLALIKLRSGLEPRYQEAGPKGPLWEEISAGMQRMGYNRSAKRCKEKWENINKYFKKVKESNKKR-PEDAKTCPYFHELDA 533
                    8*********************************************************************8.99999********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.05981143IPR001005SANT/Myb domain
CDDcd122034.72E-2583148No hitNo description
PfamPF138372.8E-2083169No hitNo description
PROSITE profilePS500906.4783141IPR017877Myb-like domain
SMARTSM007178.2E-4445507IPR001005SANT/Myb domain
PROSITE profilePS500906.795447505IPR017877Myb-like domain
CDDcd122031.83E-29447512No hitNo description
PfamPF138374.3E-23447534No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 717 aa     Download sequence    Send to blast
MQQEGGGTQS QYGEMTPAAT TTATPTTTTH ITQQQQPVES ASPISSRPPA ATSTTSNFDE  60
FMKLSGGGDE DEGDRAGGVA SGNRWPRQET LALLKTRSDM DAAFRDATVK GPLWEDVSRK  120
LAELGYKRSA KKCKEKFENV HKYYKRTKEG RAGRQDGKSY KFFSQLEALH SSTATTSNVP  180
VSLQMPVTTV TSSNAITLDV APVSIGIPMP ISSVRIPASH SPSTLGQPSN TAGSSRKRKR  240
QSSTTSSSTP RMMEFFESLM KQVMQKQETM QKRFLEVIEK REQDRMIREE AWKRQEMARL  300
AREHELMAQE RAISASRDSA IISFLQKITG QTIQLPPAVT IPAAPPPPAP AQAAPVVVLP  360
AVSLPTTTQH HVTPSSLPPE RRDQQNQQQM QSHHRHQQQQ QVKSAEVVRH QTASTSIPSE  420
VVMAIPEQQI PPSQEIASGG SFEPSSSRWP KVEVLALIKL RSGLEPRYQE AGPKGPLWEE  480
ISAGMQRMGY NRSAKRCKEK WENINKYFKK VKESNKKRPE DAKTCPYFHE LDALYRKKIL  540
GGGSTSGGGS STGAGSFNIS EERQPQQHQQ ESQKLDNTPA ATNPQESSNV STITPAPQLM  600
PVSESENKNA GSVDAQASSA GLPGSLFGEG NGGASNKPED IVKELMNQQG MQQQHQQPQT  660
SVGNDFDKAK RKRLNQGRQN KLAQQQQQLE QQERTKNIIK EESRKKNLVV WRAYTLS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1126134KRSAKKCKE
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankKU9477021e-70KU947702.1 Toxicodendron vernicifluum microsatellite c24641 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006439158.10.0trihelix transcription factor GTL1 isoform X1
SwissprotQ391173e-64TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A2H5PXW20.0A0A2H5PXW2_CITUN; Uncharacterized protein
STRINGXP_006439158.10.0(Citrus clementina)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM62262746
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.12e-66Trihelix family protein