PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023550422.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Cucurbiteae; Cucurbita; Cucurbita pepo
Family EIL
Protein Properties Length: 618aa    MW: 70219.1 Da    PI: 5.4542
Description EIL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023550422.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1EIN3515.52.3e-157514291353
                     XXXXXXXXXXXXXXXXXXXXXXX..XXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX CS
            EIN3   1 eelkkrmwkdqmllkrlkerkkqlledkeaatgakksnksneqarrkkmsraQDgiLkYMlkemevcnaqGfvYgiipekgkpvegasdsLraWW 95 
                     +el++rmw+d+mll+rlke++k+    ke+ ++++k+++s+eqarrkkmsraQD iLkYMlk+mevc+aqGfvYgiipekgkpv+gasd+LraWW
  XP_023550422.1  51 DELERRMWRDRMLLRRLKEQSKE----KES-ADNSKQRQSQEQARRKKMSRAQDCILKYMLKMMEVCKAQGFVYGIIPEKGKPVSGASDNLRAWW 140
                     79*******************98....666.8999************************************************************ PP

                     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX....XX----STTS-HHHHHHHHHHHSSSSSS-TTS--TTT--HHHH---S--HHHHHHT--TT- CS
            EIN3  96 kekvefdrngpaaiskyqaknlilsgesslqtersseshslselqDTtlgSLLsalmqhcdppqrrfplekgvepPWWPtGkelwwgelglskdq 190
                     kekv+fdrngpaai+kyqa+++i++++++++t  +s++h+l+elqDTtlgSLLsalmqhcdppqrrfplekgv+pPWWPtG+e+ww+elgl++dq
  XP_023550422.1 141 KEKVRFDRNGPAAIAKYQAEHAIPGNNDDCNT-VTSTPHTLQELQDTTLGSLLSALMQHCDPPQRRFPLEKGVSPPWWPTGNEEWWPELGLPNDQ 234
                     ******************************99.9************************************************************* PP

                     -.-----GGG--HHHHHHHHHHHHHHTGGGHHHHHHTTTTSSSSTTT--SHHHHHHHHHHTTTTT-S--XXXX..XXXXXXXXXXXXXXXXXXXX CS
            EIN3 191 gtppykkphdlkkawkvsvLtavikhmsptieeirelerqskylqdkmsakesfallsvlnqeekecatvsah..ssslrkqspkvtlsceqked 283
                     g+ppykkphdlkkawkvsvLtavikhmsp+i++ir+l+rqsk+lqdkm+akes+++l+++nqee+++++++++  ++    +s ++ +s+ +++d
  XP_023550422.1 235 GPPPYKKPHDLKKAWKVSVLTAVIKHMSPDIAKIRKLVRQSKCLQDKMTAKESATWLAIVNQEEALARKLYPDkcPPMPICGSGSLLISDTSDYD 329
                     *************************************************************************76555556************** PP

                     XX.XXXXXX.XXXXXXXXXX...............................XXXXXXXXXXXXXXXXXXXXX......XXXXXXX.XXXXXXXXX CS
            EIN3 284 ve.gkkeskikhvqavktta...............................gfpvvrkrkkkpsesakvsskevsrtcqssqfrgsetelifadk 346
                     ve +++e++  ++ + k ++                               + ++ rkrk+ ++es ++++++  +tc+ sq+++++++l+f d+
  XP_023550422.1 330 VEgVEDEPN-VQAGESKPHDlnffnmgapgprerlvmppvgtqikeefmenNSDLSRKRKQLTDESITIMNQK-LYTCEYSQCPYNSQRLGFFDR 422
                     **6667777.456666666699**************************************9999998888886.6******************** PP

                     XXXXXXX CS
            EIN3 347 nsisqne 353
                     +s+++++
  XP_023550422.1 423 TSRNNHQ 429
                     *****98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 618 aa     Download sequence    
MMNNMGIFED ISFCQNLEYF SAPPGEQETA REHEAEATLE EDYSDEELDV DELERRMWRD  60
RMLLRRLKEQ SKEKESADNS KQRQSQEQAR RKKMSRAQDC ILKYMLKMME VCKAQGFVYG  120
IIPEKGKPVS GASDNLRAWW KEKVRFDRNG PAAIAKYQAE HAIPGNNDDC NTVTSTPHTL  180
QELQDTTLGS LLSALMQHCD PPQRRFPLEK GVSPPWWPTG NEEWWPELGL PNDQGPPPYK  240
KPHDLKKAWK VSVLTAVIKH MSPDIAKIRK LVRQSKCLQD KMTAKESATW LAIVNQEEAL  300
ARKLYPDKCP PMPICGSGSL LISDTSDYDV EGVEDEPNVQ AGESKPHDLN FFNMGAPGPR  360
ERLVMPPVGT QIKEEFMENN SDLSRKRKQL TDESITIMNQ KLYTCEYSQC PYNSQRLGFF  420
DRTSRNNHQL NCPFRHDSSH IFSMPSFQTN GDKSSSPVPP SLNHSKPPPI RSMNPTPPFR  480
VSGLGLPEDD QKMISDLLSC YDSNLQQDKN TLNPSNVDAR GDNNPNQQLP KFQPQVDINM  540
YGQAAIVGNN MPIQHPDISS TKLPFEEYKA AAFDSPFSMY PNENIPDLRF GSPFNLASID  600
YAAADTPLPK QDTPLWYL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1384389SRKRKQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G27050.10.0EIL family protein