PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc023507.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family HB-PHD
Protein Properties Length: 738aa    MW: 82476.2 Da    PI: 6.2341
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc023507.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox331e-105115541053
                            HHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
               Homeobox  10 eqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRa 53 
                            + +e L  +F+ n+ ps++ +eeL+k+lgL+ ++V  WF+N R+
  Cse_sc023507.1_g020.1 511 DAVEKLRIVFATNQLPSRALKEELSKQLGLDPEKVNKWFKNARY 554
                            67889999**********************************97 PP

2PHD29.31.9e-10213266250
                            BTTTSS..TCTTSSEEEBS.SSSSEEETTTSTSSSSHHSHHSS..TBSSHHHHT CS
                    PHD   2 CkvCgk..sdeegelvlCd.gCkewfHlkClglkleseekpeg..ewlCeeCke 50 
                            C+ C+   +  +++++lCd +C+ +fH+ C++++l +e++p g   w+C+ C +
  Cse_sc023507.1_g020.1 213 CAKCKLreAFPDNDIILCDgTCNCAFHQMCIDPPLLTENIPPGdqGWFCKYCVC 266
                            88898844455********56********************9999*******87 PP

Sequence ? help Back to Top
Protein Sequence    Length: 738 aa     Download sequence    
MSGEGAEEAH KNIGELLHSK SLTKRSIKRK SHGNTQSTPV SKKKVVDYLT NSKRNDFKHI  60
NASRKPAKSM SMTSLEGDQN QAATGSVENA NLEKENFTSE NIKRRRKRKQ KRANNKVEID  120
EAAKLKRRTR YLLIKMRIEQ NLIDAYSTEG WKGQSREKIK PERELQRAKK QILQCKLGIR  180
DALHQLDLLS SDGSIDESVI GPDGTVHHDH IHCAKCKLRE AFPDNDIILC DGTCNCAFHQ  240
MCIDPPLLTE NIPPGDQGWF CKYCVCKTEI IDAMNAHLGT SYTNDSNWQE IFKVEATLPD  300
AGDTILNQED WPSDESGDDD YDPERVEKRE NSCSNGQVCS DGESSNDDAT SNYSLQSLED  360
EDLDDESLKL SITMGLESTS TDVLGQGSGS GSDGDFMSGR RQRQSVDYRK LYDEMFGKDA  420
LANEQVSEDE DWGPTNRKRR EKESDAASTL MTLCETEKSV KDTLCETEKS VKGVADTSTE  480
KIVKDVADTS TADTNLSCKG TKRSFFRIPA DAVEKLRIVF ATNQLPSRAL KEELSKQLGL  540
DPEKVNKWFK NARYLSLKTQ KLAEDIPTQM DDPSISKECG SEPTKNESID KTIPVNLPIN  600
VGHTPNDRRP KKFRRRKNAS SATSTAMQQD DGDLHLSTLA NKVDGGVDVV EDDASLKMLE  660
DNANKEKNKA VVVDGNQQTA TAAEAHMEKL CFLKTKLEKL QQVLLLRTPN RKAKTTASSS  720
IDHNDIIFFP VAHLKEKS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12832KRKSH
2103109KRRRKRK
3103110KRRRKRKQ
4104112RRRKRKQKR
5437442RKRREK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29940.21e-153HB-PHD family protein