PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc002402.1_g010.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family HD-ZIP
Protein Properties Length: 848aa    MW: 93250.2 Da    PI: 6.0061
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc002402.1_g010.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.52.3e-183088357
                           --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
               Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                           k  ++t+eq+e+Le++++++++ps  +r++L +++    +++ +q+kvWFqNrR +ek+
  Cse_sc002402.1_g010.1 30 KYVRYTTEQVEALERVYAECPKPSSLRRQQLIRECpilsNIEPKQIKVWFQNRRCREKQ 88
                           5679*****************************************************97 PP

2bZIP_120.21.3e-06821241860
                            HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH CS
                 bZIP_1  18 rrsRqRKkaeieeLeekvkeLeaeNkaLkkeleelkkevaklk 60 
                            rr+R++ ++e  +L++  k+L+a Nk L +e+++l+k+v++l 
  Cse_sc002402.1_g010.1  82 RRCREKQRKESSRLQTVNKKLSAMNKLLMEENDRLQKQVSQLV 124
                            9***************************************985 PP

3START159.62.4e-501683722201
                            HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT. CS
                  START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg. 88 
                            +aee+++e+++ka+ ++  Wv+++ +++g++++ +f+ s+++sg a+ra+g+v  ++++ ve+l d++  W +++++ e+      g 
  Cse_sc002402.1_g010.1 168 IAEETMAEFLSKATGTAVDWVQMPGMKPGPDSVGIFAISQSCSGVAARACGLVSLEPTKIVEILKDRP-SWYRDCRSLEVFTMFPAGn 254
                            799*******************************************************9999999999.***********99999999 PP

                            .EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE CS
                  START  89 .galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvd 171
                             g+++l++++  a+++l+p Rdf+++Ry+ +l+ g++v++++S++     p+     ++vR+e+lpSg+li+p+++g+s +++v+h +
  Cse_sc002402.1_g010.1 255 gGTIELIYTQIFAPTTLAPaRDFWTLRYTTSLENGSLVVCERSLTGSGAGPNpasATQFVRGEMLPSGYLIRPCDGGGSIIHIVDHLN 342
                            9********************************************9988888788899****************************** PP

                            --SSXXHHHHHHHHHHHHHHHHHHHHHHTX CS
                  START 172 lkgrlphwllrslvksglaegaktwvatlq 201
                            l++++++++lr+l++s+ + ++k++ a+l+
  Cse_sc002402.1_g010.1 343 LEAWSVPEVLRPLYESSKVVAQKMTIAALR 372
                            *************************99986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 848 aa     Download sequence    
MAMVVQHQQQ QHKEISGNNS IDKHQLDNGK YVRYTTEQVE ALERVYAECP KPSSLRRQQL  60
IRECPILSNI EPKQIKVWFQ NRRCREKQRK ESSRLQTVNK KLSAMNKLLM EENDRLQKQV  120
SQLVCENGFM RQQLHTGSSA NDASCESVIT TPQHSLRDAN NPAGLLSIAE ETMAEFLSKA  180
TGTAVDWVQM PGMKPGPDSV GIFAISQSCS GVAARACGLV SLEPTKIVEI LKDRPSWYRD  240
CRSLEVFTMF PAGNGGTIEL IYTQIFAPTT LAPARDFWTL RYTTSLENGS LVVCERSLTG  300
SGAGPNPASA TQFVRGEMLP SGYLIRPCDG GGSIIHIVDH LNLEAWSVPE VLRPLYESSK  360
VVAQKMTIAA LRFIRQIAQE SSGEVVYGLG RQPAVLRTLS QRLSRGFNDA VNGFSDDGWT  420
VMDSDGVEDV IIAVNSSKNL SNSMNPSNSL AYLGGVLCAK ASMLFQDVPP AALVRFLREH  480
RSEWADFNVD AYSAASVKAN PYSYPGMRPT RFTGSQIIMP LGHTIEHEEM LEVVRLEGHA  540
LGQEDPFMSR DIHLLQLCNG IDEHAVGACS ELVFAPIDEM FPDDAPLIPS GFRIIPLDPK  600
SSDGKNALVT THRTLDLTSS LDVSPASNHG STDMSACTST RSVLTIAFQF PFENNLAESV  660
ATMARQYVRS VINKVQRVAM AVSPSNLIPS VDPKPSPGSP EALTLAQWIC QSYTYHLGAD  720
LLSTGSVVGD SLLKDLWQHQ DAILCCSLKS LPVFIFANQA GLDMLETTLV SLQDISLDKM  780
FDDDGRKALV PEFAKIMQQG YSHLPGGICM SAMGRHITYE QAIAWKVLAA DESTVHCLAF  840
SFVNWSFV
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G60690.10.0HD-ZIP family protein