PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_024973021.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae; Carduinae; Cynara; Cynara cardunculus; Cynara cardunculus subsp. cardunculus
Family Trihelix
Protein Properties Length: 450aa    MW: 52215.3 Da    PI: 6.7047
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_024973021.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix74.71.5e-23132229186
        trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfd 83 
                     +W++ +v++Li+a+++++e+   +            +kk++W++vsk+m+erg  +sp+qC++k+++lnkryk+++e+ +++ ++e++++ +++d
  XP_024973021.1 132 KWSDPMVRLLITAVSYIGEDAAMEYgggsrrkysnlQKKGKWKSVSKVMAERGHFVSPQQCEDKFNDLNKRYKRLNEILGRGtSCEVVENPSLLD 226
                     7**************888777753222345566777**********************************************669******9999 PP

        trihelix  84 qle 86 
                      ++
  XP_024973021.1 227 LMD 229
                     997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 450 aa     Download sequence    
MEGNLSSGNM ISGQTGSSYG GLDLQGSMRA HHQQHNNPFT LNQHQHHHHL QSSRQQGSMV  60
HPSIHENLPL RMGMMQDCDR HNPTISLADF GKGERGKSCV SDEDEPSFAE DGHENRNDDN  120
RGKNMSPWQR VKWSDPMVRL LITAVSYIGE DAAMEYGGGS RRKYSNLQKK GKWKSVSKVM  180
AERGHFVSPQ QCEDKFNDLN KRYKRLNEIL GRGTSCEVVE NPSLLDLMDH VSDKAKEEVR  240
KILSSKHLYY EEMCSYHNGN RLHLPPDPEL QRSLRLALRA RDDHESNDGR RGSHEDLDED  300
DQDPYMDDHD DYEDHHHGLQ LDQRAVYGLG EGSTKRVKQC EGHDLVCNKN FGSQPKNVQA  360
DMNQALPEGV KANLLQDQWM KHRLVQLEEQ KLHIQAQMLE LEKERFKWQR FRRKKDRELE  420
MMKMENERMK LENEQMALEL KRKEMCADLN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1403415KERFKWQRFRRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-138Trihelix family protein