PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc010116.1_g100.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family EIL
Protein Properties Length: 546aa    MW: 62381.2 Da    PI: 6.1136
Description EIL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc010116.1_g100.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1EIN34849e-148424041354
                            XXXXXXXXXXXXXXXXXXXXXXX..XXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX CS
                   EIN3   1 eelkkrmwkdqmllkrlkerkkqlledkeaatgakksnksneqarrkkmsraQDgiLkYMlkemevcnaqGfvYgiipekgkpvegas 88 
                            +el++rmw+d+m+lk+lke + q   +k++ ++++k+++s+eqarrkkmsraQDgiLkYMlk+mevc+aqGfvYgiipekgkpv+gas
  Cse_sc010116.1_g100.1  42 DELERRMWRDKMKLKKLKESSCQ---AKQN-VNTAKQQQSQEQARRKKMSRAQDGILKYMLKMMEVCKAQGFVYGIIPEKGKPVSGAS 125
                            79*************99999887...3555.889999*************************************************** PP

                            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX....XX----STTS-HHHHHHHHHHHSSSSSS-TTS--TTT--HHHH---S CS
                   EIN3  89 dsLraWWkekvefdrngpaaiskyqaknlilsgesslqtersseshslselqDTtlgSLLsalmqhcdppqrrfplekgvepPWWPtG 176
                            d+Lr+WWk+kv+fdrngpaa++k qa n i+++++s ++    ++++l+elqDTtlgSLLsalmqhcdppqrrfplekgv+pPWWPtG
  Cse_sc010116.1_g100.1 126 DNLREWWKDKVRFDRNGPAAVAKFQAINSIPGENESANS-IGPTPQTLQELQDTTLGSLLSALMQHCDPPQRRFPLEKGVPPPWWPTG 212
                            ********************************9999977.9*********************************************** PP

                            --HHHHHHT--TT--..-----GGG--HHHHHHHHHHHHHHTGGGHHHHHHTTTTSSSSTTT--SHHHHHHHHHHTTTTT-S--XXXX CS
                   EIN3 177 kelwwgelglskdqg.tppykkphdlkkawkvsvLtavikhmsptieeirelerqskylqdkmsakesfallsvlnqeekecatvsah 263
                            +e+ww++lgl+kdq  +ppykkphdlkkawkv+vL+avikhm p++++ir+l+rqsk+lqdkm+akes+++l+v+nqee+++++++++
  Cse_sc010116.1_g100.1 213 NEEWWPQLGLQKDQArPPPYKKPHDLKKAWKVGVLSAVIKHMFPDVAKIRKLVRQSKCLQDKMTAKESATWLAVINQEESLARELYPD 300
                            **************99************************************************************************ PP

                            ..XX..XXXXXXXXXXXXXXXXXXXX...XXXXXX.XXXXXXXXXX.........XXXXXXXXXXXXXXXXXXXXX......XXXXXX CS
                   EIN3 264 ..ss..slrkqspkvtlsceqkedve...gkkeskikhvqavktta.........gfpvvrkrkkkpsesakvsskevsrtcqssqfr 335
                              ++  s+  +    ++++  ++dve    + e++  + q  k ++         +f++ rkrk   ++++ v  k  ++tc+  +++
  Cse_sc010116.1_g100.1 301 rcPPfmSSDCSGSFAIYDDAGEYDVEggqHEPEPN-FDLQDIKPNNlglvnyqaiDFTTDRKRKTSLTNNEVVDHK--IYTCEFLKCP 385
                            8644533333555666788899****853333333.6888999888******************999999998877..7********* PP

                            X.XXXXXXXXXXXXXXXXX CS
                   EIN3 336 gsetelifadknsisqney 354
                            +++  ++f d++ ++++++
  Cse_sc010116.1_g100.1 386 YNQHFHGFLDRSCRDNHQM 404
                            *****************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 546 aa     Download sequence    
MGFCEQLNVF CAQSAQGDIA CILNDTPTAV DDDFSDEEID VDELERRMWR DKMKLKKLKE  60
SSCQAKQNVN TAKQQQSQEQ ARRKKMSRAQ DGILKYMLKM MEVCKAQGFV YGIIPEKGKP  120
VSGASDNLRE WWKDKVRFDR NGPAAVAKFQ AINSIPGENE SANSIGPTPQ TLQELQDTTL  180
GSLLSALMQH CDPPQRRFPL EKGVPPPWWP TGNEEWWPQL GLQKDQARPP PYKKPHDLKK  240
AWKVGVLSAV IKHMFPDVAK IRKLVRQSKC LQDKMTAKES ATWLAVINQE ESLARELYPD  300
RCPPFMSSDC SGSFAIYDDA GEYDVEGGQH EPEPNFDLQD IKPNNLGLVN YQAIDFTTDR  360
KRKTSLTNNE VVDHKIYTCE FLKCPYNQHF HGFLDRSCRD NHQMNCPYRS IPTVGFRQSY  420
IQQPKQAVGP IMPEPTSPFD LSVLGVPEDG QKMINDLMSF YDTNVQENNN KPDPRNLSAS  480
NDSFLGQSSI QFQENNFLNH GFESNISLSQ QTHDFNHIQQ EGFQMMFGSH FNRDYTQPKQ  540
DGSIWF
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G20770.10.0EIL family protein