PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc020932.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family Trihelix
Protein Properties Length: 417aa    MW: 47521.2 Da    PI: 6.4137
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc020932.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix74.91.4e-2377191186
               trihelix   1 rWtkqevlaLiearremeerlrrgk............................lkkplWeevskkmrergferspkqCkekwenlnkr 60 
                            +Wt+++v++Li ++ ++++++++++                            +kk++W++vsk+m+ergf +sp+qC++k+++lnkr
  Cse_sc020932.1_g020.1  77 KWTDNMVRLLIMVVFYIGDEVGSDQppagggdmvvkkksggggggggnghgvlQKKGKWKSVSKAMMERGFYVSPQQCEDKFNDLNKR 164
                            7*********************998889999999999*************************************************** PP

               trihelix  61 ykkikegekkr.tsessstcpyfdqle 86 
                            yk+++++ +k+ +++++++  ++++++
  Cse_sc020932.1_g020.1 165 YKRVNDILGKGiACKVVENQTLLESMD 191
                            ***********89***********998 PP

Sequence ? help Back to Top
Protein Sequence    Length: 417 aa     Download sequence    
METNGIFSNS LGLEISLNHQ QNPNFILHHQ QPPYTTAPAK QYEEDEPHSN NTNNPTTTED  60
SGGDGKLRKG SPWQRMKWTD NMVRLLIMVV FYIGDEVGSD QPPAGGGDMV VKKKSGGGGG  120
GGGNGHGVLQ KKGKWKSVSK AMMERGFYVS PQQCEDKFND LNKRYKRVND ILGKGIACKV  180
VENQTLLESM DHIPKKLKEE VKKLLNSKHL FFREMCAYHN SCGGGATGVA SGVAHHSSPP  240
EVTTTTNQRQ ICVHSNHDDV DDDNDEGDDD VDEDEDDCEH FEEEDDGLSS RKRGRTMEND  300
VMHQFDGEVS VVMQDITKSV WEKRQWMKGR MMQLEEQRVG YQGQSYELKK QRLKWLKFCY  360
KKEREMEVEK LRNERVKLEN ERMVLLLKQK EMELVDLQHQ YASRVHKRTS DPSSING
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1163198KRYKRVNDILGKGIACKVVENQTLLESMDHIPKKLK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G10040.13e-95Trihelix family protein