PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028103145.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family EIL
Protein Properties Length: 595aa    MW: 67355.8 Da    PI: 6.9862
Description EIL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028103145.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1EIN34111.4e-125404111349
                     XXXXXXXXXXXXXXXXXXXXXXX..XXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX CS
            EIN3   1 eelkkrmwkdqmllkrlkerkkqlledkeaatgakksnksneqarrkkmsraQDgiLkYMlkemevcnaqGfvYgiipekgkpvegasdsLraWW 95 
                     e+l+krmwkd ++l+r+ker+k      +aa +++ ++ + ++arrkkm+ra DgiLkYMlk mevc+a+GfvYgiipekgkpv+gasd++raWW
  XP_028103145.1  40 EDLEKRMWKDHIKLRRIKERQKLAA--LQAAEKQEPKKMRLDHARRKKMARAHDGILKYMLKLMEVCKARGFVYGIIPEKGKPVSGASDNIRAWW 132
                     89*****************999644..33323333345778****************************************************** PP

                     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX....XX----STTS-HHHHHHHHHHHSSSSSS-TTS--TTT--HHHH---S--HHHHHHT--TT- CS
            EIN3  96 kekvefdrngpaaiskyqaknlilsgesslqtersseshslselqDTtlgSLLsalmqhcdppqrrfplekgvepPWWPtGkelwwgelglskdq 190
                     kekv+fd+ngpaai+ky+a++l  ++e++   +++++++ l+elqD+tlgSLLs+lmqhc+ppqr++plekg++pPWWPtG+e+ww +lgl+k q
  XP_028103145.1 133 KEKVKFDKNGPAAIVKYEAECLTKEKEDGF--QKRNSQNILQELQDATLGSLLSSLMQHCNPPQRKYPLEKGIPPPWWPTGSEEWWVKLGLPKGQ 225
                     ********************9999888888..679999********************************************************* PP

                     -.-----GGG--HHHHHHHHHHHHHHTGGGHHHHHHTTTTSSSSTTT--SHHHHHHHHHHTTTTT-S--XXXXXX.........XXXXXXXXXXX CS
            EIN3 191 gtppykkphdlkkawkvsvLtavikhmsptieeirelerqskylqdkmsakesfallsvlnqeekecatvsahss.........slrkqspkvtl 276
                     + p ykkphdlkk wkv+vLtavikhmsp++++ir+l+r+sk+lqdkm+akes ++l vl++ee++++  s+++          s + +++k  +
  XP_028103145.1 226 S-PLYKKPHDLKKLWKVGVLTAVIKHMSPDVAKIRRLVRKSKCLQDKMTAKESSIWLGVLSREESLFQLPSSDNGasgmseppsSGHGEKKKPSV 319
                     *.9****************************************************************9999993344566777444448999999 PP

                     XXXXXXXXXXXXXXX.XXXXXXXXXX.......................XXXXXXXXXXXXXXXXXXXXX......XXXXXXX.XXXXXXXXXXX CS
            EIN3 277 sceqkedvegkkeskikhvqavktta.......................gfpvvrkrkkkpsesakvsskevsrtcqssqfrgsetelifadkns 348
                     s+++++dv+g ++   + +++++ ++                        ++ +  rk+++++s+++ ++       ss+  ++e++++++d n+
  XP_028103145.1 320 SSNSDYDVDGIDDGVSSVSSKDDLRDqpvevepssqpqnttphhfkdneRLEEQPSRKRRRVRSSSAAQQ----AAPSSEHLHHEPRNTLPDINQ 410
                     *********5555444555555555567778888889999999999876666666666666666665555....567888888999999999987 PP

                     X CS
            EIN3 349 i 349
                     +
  XP_028103145.1 411 T 411
                     6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 595 aa     Download sequence    
MDHFMIHANG LGDSSDVEMD EIRCENIAEK DVSDEEIKAE DLEKRMWKDH IKLRRIKERQ  60
KLAALQAAEK QEPKKMRLDH ARRKKMARAH DGILKYMLKL MEVCKARGFV YGIIPEKGKP  120
VSGASDNIRA WWKEKVKFDK NGPAAIVKYE AECLTKEKED GFQKRNSQNI LQELQDATLG  180
SLLSSLMQHC NPPQRKYPLE KGIPPPWWPT GSEEWWVKLG LPKGQSPLYK KPHDLKKLWK  240
VGVLTAVIKH MSPDVAKIRR LVRKSKCLQD KMTAKESSIW LGVLSREESL FQLPSSDNGA  300
SGMSEPPSSG HGEKKKPSVS SNSDYDVDGI DDGVSSVSSK DDLRDQPVEV EPSSQPQNTT  360
PHHFKDNERL EEQPSRKRRR VRSSSAAQQA APSSEHLHHE PRNTLPDINQ TDVPSPMCGG  420
RHKNDTTETM RPVEKGLQDP RLHIEIQNTR LLNGRNSGLH QDQRSKDSRM HHGPTHDFHS  480
LSLEFGSSSV GQQTQMGFSE PWVRPEDSGV NVPPLHENEN AIFEGDMHQY LKDTFQNDQD  540
KPLANHFGSP INSLSLDYGG FDSLFHLGIN DSGSLDTSDI DYLLKDDWML DCFGA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1376380RKRRR
2377381KRRRV
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73730.11e-153EIL family protein