PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG70478.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family EIL
Protein Properties Length: 832aa    MW: 89261.1 Da    PI: 6.872
Description EIL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG70478.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1EIN3425.36.1e-130414801352
                 XXXXXXXXXXXXXXXXXXXXXXX..XXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX CS
        EIN3   1 eelkkrmwkdqmllkrlkerkkqlledkeaatgakksnksneqarrkkmsraQDgiLkYMlkemevcnaqGfvYgiipekgkpvegasdsLraWWkekv 99 
                 +el++rmw+d+ +l+r+ke +k     ke   ++ ++++s+eqarrkkmsra DgiLkYMlk+mevc+aqGfvYgiipekgkpv g+sd+LraWWk+kv
  GBG70478.1  41 DELERRMWRDRLKLRRIKELQKA----KEL-SDKPRQKQSQEQARRKKMSRAHDGILKYMLKMMEVCKAQGFVYGIIPEKGKPVGGSSDNLRAWWKDKV 134
                 79*************99998886....566.7888999************************************************************* PP

                 XXXXXXXXXXXXXXXXXXXXXXXXXX....XX----STTS-HHHHHHHHHHHSSSSSS-TTS--TTT--HHHH---S--HHHHHHT--TT--.-----G CS
        EIN3 100 efdrngpaaiskyqaknlilsgesslqtersseshslselqDTtlgSLLsalmqhcdppqrrfplekgvepPWWPtGkelwwgelglskdqgtppykkp 198
                 +fdrngp a +ky+a++ + + +++ +     ++ +l+elqDTtlgSLLsalmqhcdp qrrfplekg++pPWWPtG+e+ww+ +gl+k qg+ppykkp
  GBG70478.1 135 RFDRNGPLAAAKYAAEHCLNK-SEGVEC-IAPTPRTLQELQDTTLGSLLSALMQHCDPKQRRFPLEKGIPPPWWPTGNEEWWPLVGLPKGQGAPPYKKP 231
                 *************87777655.555555.899******************************************************************* PP

                 GG--HHHHHHHHHHHHHHTGGGHHHHHHTTTTSSSSTTT--SHHHHHHHHHHTTTTT-S--XXXXXX.............................XXX CS
        EIN3 199 hdlkkawkvsvLtavikhmsptieeirelerqskylqdkmsakesfallsvlnqeekecatvsahss.............................slr 268
                 hdlkkawkv+vLtavikhmsp+i++ir+l+rqsk+lqdkm+a+es+++l++lnqee++++    + +                              +r
  GBG70478.1 232 HDLKKAWKVGVLTAVIKHMSPDISKIRKLVRQSKCLQDKMTARESATWLAILNQEEALARVARGDAEhaalghmscaagalydgaemkvgalisldPSR 330
                 ********************************************************9987666553345666777778888888888888777665555 PP

                 XXXXXXXXXXXXXXXXX......XXXXXX.XXXXXXXXXX........................................................... CS
        EIN3 269 kqspkvtlsceqkedve......gkkeskikhvqavktta........................................................... 302
                   + +v l+ ++++dve      +  ++  ++       +                                                           
  GBG70478.1 331 CTESAVALQPSSDYDVEvaepfdQ--ATTSSQT------Signdessdyevhdcdvygpgraigagkgsglmtrviespssaksqstgsrsspdspeag 421
                 577778888888888885444221..2221111......123344566778888888888888888888888888888888888888888888888888 PP

                 ............XXXXXXXXXXXXXXXXXXXXX......XXXXXXX.XXXXXXXXXXXXXXX CS
        EIN3 303 ............gfpvvrkrkkkpsesakvsskevsrtcqssqfrgsetelifadknsisqn 352
                             g+p+    kk++++s      + + +c  ++++++e ++ fa ++ ++ +
  GBG70478.1 422 gaenivriadgdGVPTPGNAKKRRADSVPE---HRIFMCPYENCPRHEWRNAFAARELRNLH 480
                 888888888887777777666444444432...33588888888888888888888877766 PP

Sequence ? help Back to Top
Protein Sequence    Length: 832 aa     Download sequence    
MGDGGDEFAC RGGNVDMGDI MGEGDGGLHD EDVSDEDIDI DELERRMWRD RLKLRRIKEL  60
QKAKELSDKP RQKQSQEQAR RKKMSRAHDG ILKYMLKMME VCKAQGFVYG IIPEKGKPVG  120
GSSDNLRAWW KDKVRFDRNG PLAAAKYAAE HCLNKSEGVE CIAPTPRTLQ ELQDTTLGSL  180
LSALMQHCDP KQRRFPLEKG IPPPWWPTGN EEWWPLVGLP KGQGAPPYKK PHDLKKAWKV  240
GVLTAVIKHM SPDISKIRKL VRQSKCLQDK MTARESATWL AILNQEEALA RVARGDAEHA  300
ALGHMSCAAG ALYDGAEMKV GALISLDPSR CTESAVALQP SSDYDVEVAE PFDQATTSSQ  360
TSIGNDESSD YEVHDCDVYG PGRAIGAGKG SGLMTRVIES PSSAKSQSTG SRSSPDSPEA  420
GGAENIVRIA DGDGVPTPGN AKKRRADSVP EHRIFMCPYE NCPRHEWRNA FAARELRNLH  480
QENCPYRPGG PLSASPSNLP ASLTPSQLPR VRPSAVAPVI HTVPAMGGTS FSQSMEGPGD  540
SHVQGHPHPT AGFIQGNMQI GLQIPVIGHP LGMQACNPNS PHDHVMGQVS GGNRLEGGPG  600
HLHDLFAGLY REGLAQGSPG VPPNVAAPMH GGASRAAGSQ SEMEPGMGSM NTSPGHVHVH  660
DDGSQHQTHD DDTMAARRMA VDGTAASEGG EDAPRSYTQG FIPVTDGATP VVNHIQCDSL  720
VPRPGRNTMM MMNGNNHPNG MNGMHGKHAG MVMGYEHGQR GRIPAEFANA PSGSGHGGHG  780
PPHRGGHGGH IPHGPHGGHA FQLSIRHGES PSLTLGPPPG DPFEDLIWYF GA
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G20770.11e-131EIL family protein