PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc009449.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family HD-ZIP
Protein Properties Length: 1049aa    MW: 115671 Da    PI: 6.2766
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc009449.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.11.5e-182078357
                           --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
               Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                           k  ++t+eq+e+Le+l++ +++ps  +r++L +++    +++ +q+kvWFqNrR +ek+
  Cse_sc009449.1_g020.1 20 KYVRYTPEQVEALERLYHDCPKPSSLRRQQLIRECpilsNIEPKQIKVWFQNRRCREKQ 78
                           5679*****************************************************97 PP

2START576.1e-19146241296
                            HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT. CS
                  START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg. 88 
                            +aee+++e+++ka+ ++  Wv+++ +++g++++ +++ s++++g a+ra+g+v  +++  v+e+l+d++ W ++++++++l+v+++g 
  Cse_sc009449.1_g020.1 146 IAEETLTEFLSKATGTAVEWVQMPGMKPGPDSIGIIAISHGCTGVASRACGLVGLEPT-RVAEILKDRPSWYRDCRAVDVLNVLTTGt 232
                            789*******************************************************.8888888888******************9 PP

                            .EEEEEEEE CS
                  START  89 .galqlmva 96 
                             g+++l ++
  Cse_sc009449.1_g020.1 233 nGTIELLYM 241
                            977666554 PP

3START58.81.8e-19254337125205
                            EEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                  START 125 ivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                            +++ S++++q+ p+    +++vRa++ pSg+li+p+++g+s +++v+h+d ++ +++++lr+l++s++  +++t++a+++++++
  Cse_sc009449.1_g020.1 254 VCESSLNNTQNGPSmppVPHFVRAKMMPSGYLIRPCDGGGSIIHIVDHIDFESGSIPEVLRPLYESSTLLAQRTTLAAFRQLRQ 337
                            799999999999988889*************************************************************99876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1049 aa     Download sequence    
MMAVSSSCKG GDSLGMDNGK YVRYTPEQVE ALERLYHDCP KPSSLRRQQL IRECPILSNI  60
EPKQIKVWFQ NRRCREKQRK EASRLQAVNR KLSAMNKLLM EENDRLQKQV SNLVYENGYF  120
LVTSGQHHLS PQHPPRDASP AGLLSIAEET LTEFLSKATG TAVEWVQMPG MKPGPDSIGI  180
IAISHGCTGV ASRACGLVGL EPTRVAEILK DRPSWYRDCR AVDVLNVLTT GTNGTIELLY  240
MQKRCDFDFV KIKVCESSLN NTQNGPSMPP VPHFVRAKMM PSGYLIRPCD GGGSIIHIVD  300
HIDFESGSIP EVLRPLYESS TLLAQRTTLA AFRQLRQISQ EITQPMVTSW GRRPAALRAL  360
GQRMSRGFNE AINGFTDEGW TMMESDGIDD VTVLVNSSPD KVMGAAPMYA DGFPSSINNA  420
VLCAKASMLL QNVPPAILTR FLREHRSEWA DSSIDGYSAA SVKAGPCGLP LARNGSFGGQ  480
FMEVIKLENM AHYRPDDMLM PADIFFLQVS SGHSVFCWLS SCDLDFEKSI NLIYFVSQLC  540
SGVDENAIGT SAELIFAPID ASFTDDAPLL PSGFRIIPIN NVTNSQNPTR DLASTLEAGP  600
PGKRTAADYL GQSGPTKSVM TIAFQFAYEI HLQENIAAMA RQYVRSIIAS VQRVALALSP  660
SPFGARSLQA PSGTPEAHLL TRWICQSFRF FLGEELFKNV DERSDSMLKT LWHHSDAIMC  720
CSLKAVPDFT FANQAGLDML ETTLVSLQDI TLDKIFEGGG RTNNVCSELP QILQQGFACL  780
PGGVCLTSMG RPVSYERAVA WKVLNDEENP HCIAFVAPGP STRTVYELLL PSGQRDGVSF  840
LSEPSSPNRC ADTLDATNPQ IVIDLRNSSG TCSLKRKGSF GVESVSSTID EKRVKKSIDE  900
CEPSTAQLNF NSICLDSMLS QHQVPDVPTF TGNRCPQIDP GFTETRRTTM AKGGRVKVFI  960
PQEYRSFIHH LFLDKHFLEN SRAYNQMFTM TSLGANIDES LTDEPAEKKQ TPKDVGSTAV  1020
RKQLFESSAA EEQAPEAKKP KKDDEPPQE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32880.10.0HD-ZIP family protein