PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc014659.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family HD-ZIP
Protein Properties Length: 504aa    MW: 58467.2 Da    PI: 8.7024
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc014659.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox55.97.2e-1895149256
                            T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
               Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                            rk+ ++t eq++ Le+ F+ ++++++ +++eLAkkl+L  rq+ vWFqNrRa+ k
  Cse_sc014659.1_g020.1  95 RKKLKLTMEQTTLLEDRFKIHSTLNTGQKQELAKKLNLLPRQIEVWFQNRRARTK 149
                            788899***********************************************98 PP

2Homeobox58.31.3e-18347401256
                            T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
               Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                            rk+ ++t+eq++ Le+ F+ ++++++ +++eLAkkl+L  rq+ vWFqNrRa+ k
  Cse_sc014659.1_g020.1 347 RKKLKLTTEQITLLEDSFKIHSTLNTGQKQELAKKLHLLPRQIEVWFQNRRARTK 401
                            788899***********************************************98 PP

3HD-ZIP_I/II106.52.1e-3495182188
            HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLr 88 
                            +kk++l+ eq +lLE+ F+ +++L++ +K+ela++L+l prq++vWFqnrRARtk+k++E+dye Lk+++++lk+en+rL+ke +e+r
  Cse_sc014659.1_g020.1  95 RKKLKLTMEQTTLLEDRFKIHSTLNTGQKQELAKKLNLLPRQIEVWFQNRRARTKLKHIEQDYEWLKKCFETLKDENSRLKKELQEVR 182
                            69*********************************************************************************99976 PP

4HD-ZIP_I/II109.22.8e-35347434188
            HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLr 88 
                            +kk++l++eq++lLE+sF+ +++L++ +K+ela++L+l prq++vWFqnrRARtk+kq+E++++ Lk+++++l+een+rL+ke +e r
  Cse_sc014659.1_g020.1 347 RKKLKLTTEQITLLEDSFKIHSTLNTGQKQELAKKLHLLPRQIEVWFQNRRARTKLKQIEQECSLLKKCCETLSEENRRLKKELREAR 434
                            69*********************************************************************************98766 PP

Sequence ? help Back to Top
Protein Sequence    Length: 504 aa     Download sequence    
MNEEERCNTT LGLGIGVGIK KEKQSKQTKK SLWLDLSLPL HPKIEPSDHD QKNDEGTDDQ  60
DSLSSKTIDD QLEEERGIKR SVSNTNLYNN NDGSRKKLKL TMEQTTLLED RFKIHSTLNT  120
GQKQELAKKL NLLPRQIEVW FQNRRARTKL KHIEQDYEWL KKCFETLKDE NSRLKKELQE  180
VRCFSLKVDN HQPLPQLPLP FYIRYPTTTM MHNPCDKNGK SREITKVAVV NGGNMGHKLH  240
DDVCKIMSKE ERCNTTLGLG IEVEVKREKQ LKQKKRGFWL DLALPIHPKV ELTDHGDYEH  300
KHDEEKDDQD SCSSKTIDDL EEKEERGDKR SDSSNFDVYH DNSGGSRKKL KLTTEQITLL  360
EDSFKIHSTL NTGQKQELAK KLHLLPRQIE VWFQNRRART KLKQIEQECS LLKKCCETLS  420
EENRRLKKEL REARSSSKLD NHQQQPQLPP SFYIRYSTKT ETRHPCDNIG KIGEDMKPVA  480
AVNGSIVVHE QDNSNKKNAT ATLL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1396404RRARTKLKQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G37790.18e-38HD-ZIP family protein