PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_024973022.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae; Carduinae; Cynara; Cynara cardunculus; Cynara cardunculus subsp. cardunculus
Family Trihelix
Protein Properties Length: 423aa    MW: 49585.5 Da    PI: 6.845
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_024973022.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix74.91.4e-23105202186
        trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfd 83 
                     +W++ +v++Li+a+++++e+   +            +kk++W++vsk+m+erg  +sp+qC++k+++lnkryk+++e+ +++ ++e++++ +++d
  XP_024973022.1 105 KWSDPMVRLLITAVSYIGEDAAMEYgggsrrkysnlQKKGKWKSVSKVMAERGHFVSPQQCEDKFNDLNKRYKRLNEILGRGtSCEVVENPSLLD 199
                     7**************888777753222345566777**********************************************669******9999 PP

        trihelix  84 qle 86 
                      ++
  XP_024973022.1 200 LMD 202
                     997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 423 aa     Download sequence    
MRAHHQQHNN PFTLNQHQHH HHLQSSRQQG SMVHPSIHEN LPLRMGMMQD CDRHNPTISL  60
ADFGKGERGK SCVSDEDEPS FAEDGHENRN DDNRGKNMSP WQRVKWSDPM VRLLITAVSY  120
IGEDAAMEYG GGSRRKYSNL QKKGKWKSVS KVMAERGHFV SPQQCEDKFN DLNKRYKRLN  180
EILGRGTSCE VVENPSLLDL MDHVSDKAKE EVRKILSSKH LYYEEMCSYH NGNRLHLPPD  240
PELQRSLRLA LRARDDHESN DGRRGSHEDL DEDDQDPYMD DHDDYEDHHH GLQLDQRAVY  300
GLGEGSTKRV KQCEGHDLVC NKNFGSQPKN VQADMNQALP EGVKANLLQD QWMKHRLVQL  360
EEQKLHIQAQ MLELEKERFK WQRFRRKKDR ELEMMKMENE RMKLENEQMA LELKRKEMCA  420
DLN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1376388KERFKWQRFRRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-130Trihelix family protein