PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP013534.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family EIL
Protein Properties Length: 2947aa    MW: 326268 Da    PI: 6.6292
Description EIL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP013534.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1EIN3424.31.3e-129237727051303
                   XXXXXXXXXXXXXXXXXXXXXXX..XXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX CS
         EIN3    1 eelkkrmwkdqmllkrlkerkkqlledkeaatgakksnksneqarrkkmsraQDgiLkYMlkemevcnaqGfvYgiipekgkpvegasdsLraWWk 96  
                   e+l++rmwkd+++lkrlker+k   e+++a ++++k +++++qarrkkmsraQDgiLkYMlk mevc+a+GfvYgiipekgkpv+gasd++raWWk
  PCP013534.1 2377 EDLERRMWKDRIKLKRLKERQKL--EAQQA-AEKQKPKQTTDQARRKKMSRAQDGILKYMLKLMEVCKARGFVYGIIPEKGKPVSGASDNIRAWWK 2469
                   89*****************9996..45665.99*************************************************************** PP

                   XXXXXXXXXXXXXXXXXXXXXXXXXXXXX....XX----STTS-HHHHHHHHHHHSSSSSS-TTS--TTT--HHHH---S--HHHHHHT--TT--. CS
         EIN3   97 ekvefdrngpaaiskyqaknlilsgesslqtersseshslselqDTtlgSLLsalmqhcdppqrrfplekgvepPWWPtGkelwwgelglskdqgt 192 
                   ekv+fd+ngpaai+k++a++l++s+++++  ++ ++++ l++lqD+tlgSLLs+lmqhcdppqr++plekg +pPWWPtG+e+ww +lgl + q  
  PCP013534.1 2470 EKVKFDKNGPAAIAKHEAECLAMSDADNN--RNGNSQSILQDLQDATLGSLLSSLMQHCDPPQRKYPLEKGAPPPWWPTGNEDWWVKLGLVHGQI- 2562
                   ************************99998..68999999****************************************************9999. PP

                   -----GGG--HHHHHHHHHHHHHHTGGGHHHHHHTTTTSSSSTTT--SHHHHHHHHHHTTTTT-S--XXXXXX........XXXXXXXXXXXXXXX CS
         EIN3  193 ppykkphdlkkawkvsvLtavikhmsptieeirelerqskylqdkmsakesfallsvlnqeekecatvsahss........slrkqspkvtlsceq 280 
                   ppykkphdlkk+wkv+vLtavikhmsp+i++ir+++rqsk+lqdkm+akes+++l vl++ee+++ + s+++         ++ +++++  +s+++
  PCP013534.1 2563 PPYKKPHDLKKMWKVGVLTAVIKHMSPDIAKIRRHVRQSKCLQDKMTAKESAIWLGVLSREESLIWQPSSDNGtsgitempQSGHGEKQGAASSNS 2658
                   9*********************************************************************94445677776667799999****** PP

                   XXXXXXXXXXX.X.XXXXXXXXX.......................X CS
         EIN3  281 kedvegkkeskik.hvqavktta.......................g 303 
                   ++dv+g+++   + +++++++++                       +
  PCP013534.1 2659 DYDVDGTDDGVGSvSSSKDDRRNqvmdvdpssnlhnnarnnvqdkeQ 2705
                   *****555544331233333333566777766666555444333221 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2947 aa     Download sequence    
MDPGKSVEDQ FSRLHPCLPL NTRIGIVGGG PSGLSSAYAL VKLGYSNVTV LEKHHTVGGM  60
CESVDIEGKI YDLGGQVLAA NSAPVIFHLA KETGSELDEM DSHKLALIDK SGQYQDIKVA  120
DDYVSVISLT LELQDKAAKS GRIGVHAVSE YASDLTPVYL ERQGFSSVPK SVAYGYTASG  180
YGFVQDMPYA YIHEFTRTSM AGKIRRFKGG YTSFWEKISK SLPMVHCNTE VLEIRRYSDS  240
VGVDVKSCDG EVKSMEFDKI IISGSFPLNS GRIYRSPSHP TEHGSEVMEM GDVEKELFSK  300
VQTIDYYTTV LKVKGIEHMP IGFFYFEEYM SNPATIGNPV AMQKFYADTD ILLFWSYGNS  360
VDITGTTVTE LAIDAAKLIG AEVTEVVLQR RFKYFPHVGS QEMKDGFYEK LESELQGFRN  420
TYYVGGLMAF ELTERNSSYA MGLVCKHFAN DNSVPKFPYA KSLFSSQQRW GGSPKRMTEV  480
PGVEFPNLSS LDGYLKHWGA HGVTQNKTLY TWVNEEGVVV SQRTYAELHD NASCIAQKLL  540
TCKNPVIRPR DRVLLVHVPG LDFVDAFFGC LRANVLPVPV LPPDPLQRGG QALLKIENIA  600
KSCGAVAILS TISYHWAVQA GSVKNMISLI AKKPKSSGRW PNLPWLHTDS WIKNSKNVVV  660
EDFKDEFEPQ PGEVSFLQFT SGSTGDAKGV MITHSALIHN VKLMRRRYNS TSRTVLVSWL  720
PQYHDMGLIG GIFTALVSGG TAVLFSPLTF IRNPLLWLQI MSKYQATHSA GPNFAFELVI  780
RRLESDKNRK YDLSSMKFLM VAAEPVRQKT LKRFVELARH FGLSQEVMAP GYGLAENCVF  840
VSCAYGEGKP IMVDWQGRVC CGYVNPGDED VDIRIVDPES GEELKEAGKE GEIWISSPSA  900
GIGYWGREEL SQNTYRNKLP DYPGRIYTRT GDLGRIIDRK LFITGRIKDL IIVAGRNIYS  960
ADVEKTVESA AEVVRPGCCA VIAVPMEILS MKGISVPDNA DQVGLVVIAE VRDGKPVGKD  1020
VVEQIQARVA EEHGVTVASV KMIRPKTISK TTSGKIQRFE CLQQFTDGTL NVVPEPILIK  1080
KKLMRSFTTG TCKEGITPRP QFVRGSPPPS PKISNNDIVD FLKRLVSEQT GISINKISNT  1140
ESLVSYGIDS IGVVRAAQKL SDFLGVPVGA VDIFTATCIA DLASFSENLV MNSQPQLSTT  1200
PSNVPQLETD TDSAELVMEL PESQHLVIWS FQLLALVYVA FMLSIPAYLS VSAFMKCVSA  1260
THALVEEIPY LDYLILLTFA PLAWILCILS TCVSIAFLGN SFLKPNYALN PEVSIWSVDF  1320
VKWWALYKAH EVASKVLAEH LRGTMFLKYW FEMLGARIGS SVFLDTVDIT DPSLVSIGDG  1380
AVIAEGALIQ SHEVKNGVLS FLPIRIGQNS SVGPYAVVQK GTILGEDLEV MALQKSGGKS  1440
AAEATNLQNG KMLPNVTKET EDGVVYQFIG IYIVGLLGTL SASIVYLVYI WMSQKPLSPQ  1500
EFAFACLFGA FHWMPYTVIA YATMFSNVPL GIIYSSISMA VAYLAYGIVL SFLTAALTRL  1560
ISSNQGKKTT HFRMWLCHRI TIACHHRFAK LLSGTEAFCM YMRLLGAKVG KHCSIRAINP  1620
ISDPKLISLG SGVHLGDFSR IIAGYYSSHG LVSGKVEVHD NSVVGSESLV LPGSILQKDV  1680
ILGALSVAPV NSVLQAGGVY IGSQTPMMIK NTMHSLEDRI EEMDMKYKKI VGNLAANLAA  1740
TTLKVKSRYF HRIGVSGKGT LKIYDNIKGL PDHKIFCPGK SYPVIVRHSN SLSADDDARI  1800
DARGAAIRIL SDESSESSLF DLTLKTGKAF YARTIADFAT WLVCGLPARE EYVKRAPHVR  1860
DAVWTSLRYA NSYVELHYYS NICRLFRFED GQEMYVKFKL RPSDENISEE AGKVEPIGIL  1920
PPDTGAIPRS DNDTRPLLFL AKDFQSRVNA EGVRYIFQLQ VRPVPRDEAA RDIALDCTKP  1980
WSESEFPYID VGEVNINHNL SAEQSEQLDF NPFLRCPEVD VIRASSCSQS ASIDHGRSLI  2040
YEICQHLRNG APLPEAWKIF LEQSDVKVDL SGCPMAASLM KKDAQKVTLE RTLFQALWAT  2100
FAQPLLQTVL PHFLLALVIY APLNWTLHMN NTQKIALHWL FPLFWVSSGL LAGLACVVAK  2160
WLLVGKKKEG ETVHIWSIGV FLDTTWQAFR TLAGSYFAEM TSGSIFFVLW MKLMGSEIEL  2220
DQGAYVDSMG ALLNPEMVEI ERGGCVGREA LLFGHIYEGD EGKVKFGKIS VGEGGFVGSR  2280
AIAMPGVRVE VGGSLSALSL AMKEEIIMSR FFSSSSSSFS VPPSCNFSSS SFSDHCEWHW  2340
MGDVGEIGPD ISSDIEEDLR CDNIAEKDVS DEEIEAEDLE RRMWKDRIKL KRLKERQKLE  2400
AQQAAEKQKP KQTTDQARRK KMSRAQDGIL KYMLKLMEVC KARGFVYGII PEKGKPVSGA  2460
SDNIRAWWKE KVKFDKNGPA AIAKHEAECL AMSDADNNRN GNSQSILQDL QDATLGSLLS  2520
SLMQHCDPPQ RKYPLEKGAP PPWWPTGNED WWVKLGLVHG QIPPYKKPHD LKKMWKVGVL  2580
TAVIKHMSPD IAKIRRHVRQ SKCLQDKMTA KESAIWLGVL SREESLIWQP SSDNGTSGIT  2640
EMPQSGHGEK QGAASSNSDY DVDGTDDGVG SVSSSKDDRR NQVMDVDPSS NLHNNARNNV  2700
QDKEQGEKKL RRKRARVRAS PVERRPAPSH NEHLHVPPRG ALPDINHTDV QMIGLQVHEN  2760
QQENGAITTL RPLENDLDVQ AQLPAPELNY YPGVPSGNVI TTQGMHVGGT PLLYHGVRDA  2820
DMHHEDTFNQ HHGDTFNLYN PSTQYPPSHD QQPPQIVMNE PQIIPADGVH VPVHRNGSEI  2880
AGGDFPNFVQ DTFQSEQDRT VNANFGSPID SLSLDYGLFK SPFNFGIDGT GSLDDLELEE  2940
MMEYFAA
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73730.11e-156EIL family protein