PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PHT41004.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family EIL
Protein Properties Length: 711aa    MW: 80099.2 Da    PI: 5.4544
Description EIL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PHT41004.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1EIN3513.49.9e-157514261354
                 XXXXXXXXXXXXXXXXXXXXXXX..XXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX CS
        EIN3   1 eelkkrmwkdqmllkrlkerkkqlledkeaatgakksnksneqarrkkmsraQDgiLkYMlkemevcnaqGfvYgiipekgkpvegasdsLraWWkekv 99 
                 +el++rmw+d+ml++rlke+ k+    ke+  + +k+++s+eqarrkkmsraQDgiLkYMlk+mevcnaqGfvYgiipekgkpv+gasd+LraWWkekv
  PHT41004.1  51 DELERRMWRDRMLMRRLKEKIKN----KEV-GDGAKQRQSQEQARRKKMSRAQDGILKYMLKMMEVCNAQGFVYGIIPEKGKPVTGASDNLRAWWKEKV 144
                 79***************999997....787.788899************************************************************** PP

                 XXXXXXXXXXXXXXXXXXXXXXXXXX....XX----STTS-HHHHHHHHHHHSSSSSS-TTS--TTT--HHHH---S--HHHHHHT--TT--.-----G CS
        EIN3 100 efdrngpaaiskyqaknlilsgesslqtersseshslselqDTtlgSLLsalmqhcdppqrrfplekgvepPWWPtGkelwwgelglskdqgtppykkp 198
                 +fdrngpaai+kyqa+n i+++ +++++   s++h+l+elqDTtlgSLLsalmqhcdppqrrfplekgv+pPWWP+Gke+wwg+lgl++dq +ppykkp
  PHT41004.1 145 RFDRNGPAAIAKYQADNQIPGRVEESSV-IVSTPHTLQELQDTTLGSLLSALMQHCDPPQRRFPLEKGVAPPWWPSGKEDWWGQLGLPNDQIQPPYKKP 242
                 **********************999988.********************************************************************** PP

                 GG--HHHHHHHHHHHHHHTGGGHHHHHHTTTTSSSSTTT--SHHHHHHHHHHTTTTT-S--XXXX..XX.XXXXXXXXXXXXXXXXXXXXXXXXXX.XX CS
        EIN3 199 hdlkkawkvsvLtavikhmsptieeirelerqskylqdkmsakesfallsvlnqeekecatvsah..ss.slrkqspkvtlsceqkedvegkkeskikh 294
                 hdlkkawkv+vLtavikh+sp+i++ir+l+rqsk+lqdkm+akes+++l+++nqee++++++++h  ++ sl  ++ +v++s+ +++dveg ++++ ++
  PHT41004.1 243 HDLKKAWKVGVLTAVIKHISPDIAKIRKLVRQSKCLQDKMTAKESATWLAIINQEEALARKLYPHiyPQgSLAIGNGSVFISDASDYDVEGVEDERNNE 341
                 *****************************************************************874469999****************666666455 PP

                 XXXXXXXX.............................XXXXXXXXXXXXXXXXXXXXX......XXXXXXX.XXXXXXXXXXXXXXXXX CS
        EIN3 295 vqavktta.............................gfpvvrkrkkkpsesakvsskevsrtcqssqfrgsetelifadknsisqney 354
                 v++ k ++                             + ++++krk +p  +++v +k  ++tc+  ++++++++++f+d++ ++++++
  PHT41004.1 342 VEC-KPHDinlqtgimvpkdgilmpafapvkgeiidlTSDFTQKRK-QPYFEESVDQK--IYTCEYLHCPYNNYQAGFHDRTTRNNHQI 426
                 555.444558***************************9*******9.67777777777..7**************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 711 aa     Download sequence    
MGIFEDMGFS GNFDLLSDPL GCGGDVAQEV EHKPVGLVEE DDYSDEEMDV DELERRMWRD  60
RMLMRRLKEK IKNKEVGDGA KQRQSQEQAR RKKMSRAQDG ILKYMLKMME VCNAQGFVYG  120
IIPEKGKPVT GASDNLRAWW KEKVRFDRNG PAAIAKYQAD NQIPGRVEES SVIVSTPHTL  180
QELQDTTLGS LLSALMQHCD PPQRRFPLEK GVAPPWWPSG KEDWWGQLGL PNDQIQPPYK  240
KPHDLKKAWK VGVLTAVIKH ISPDIAKIRK LVRQSKCLQD KMTAKESATW LAIINQEEAL  300
ARKLYPHIYP QGSLAIGNGS VFISDASDYD VEGVEDERNN EVECKPHDIN LQTGIMVPKD  360
GILMPAFAPV KGEIIDLTSD FTQKRKQPYF EESVDQKIYT CEYLHCPYNN YQAGFHDRTT  420
RNNHQIICPF RFNSAQTLGT PNYQINNENN RGFPAQIAPP KPAVSSVTAS SSMSFSGLRL  480
PEDDHRMISD LITSYDSNFQ HNGSICSGNS EILVNQNMQQ QQSVEVPMDD NFNLGHMETA  540
AQGNSMPVNS AYRSTQFQYD QCKLPFDAPL TGNLNDITDF RFGSPFNLGG NDYSMDELTK  600
QDISTCYVEL IDPPPSSTSH RPRRGRPRKI PILISILSEN SNPENEKTLE GINGVAASVI  660
SYEDTTMLNA RVVPEMDAIV KKSNNTISPF LENHENKHDT ISWQTTVQTG C
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1623629RRGRPRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G20770.10.0EIL family protein