PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc002049.1_g060.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family NF-X1
Protein Properties Length: 858aa    MW: 96493.4 Da    PI: 8.3817
Description NF-X1 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc002049.1_g060.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-NF-X118.93.4e-06178196120
               zf-NF-X1   1 CGkHkCqklCHeGpCppCpq 20 
                            CG HkC  lCH+GpCp+Cp+
  Cse_sc002049.1_g060.1 178 CG-HKCLLLCHPGPCPSCPK 196
                            **.***************96 PP

2zf-NF-X117.68.1e-06283302120
               zf-NF-X1   1 CGkHkCqklCHeGpCppCpq 20 
                            CGkH+C+k CHeG+C++Cp+
  Cse_sc002049.1_g060.1 283 CGKHSCSKGCHEGECGKCPL 302
                            ******************95 PP

3zf-NF-X116.41.9e-05337356119
               zf-NF-X1   1 CGkHkCqklCHeGpCpp.Cp 19 
                            CG+H+C + CH+G C + C+
  Cse_sc002049.1_g060.1 337 CGIHRCPERCHRGSCVEtCR 356
                            ***************99886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 858 aa     Download sequence    
MKSTVNHHRP PPQPPSDSDS SDSDTTTTST KKPDFSNTIF QAYTHLSNHD SPDLTKIQSF  60
LTSSRAGALS CLICLERIRP TDPTWQCSTR CHALFHLICI QSWARQSSDL SSARALTRGA  120
EDSTATWNCP KCRIEFPKNL VPKKYLCFCG KVEDPVHDPW VLPHSCGEMC LRDLKYDCGH  180
KCLLLCHPGP CPSCPKLVKV RCFCGGVEDV KRCGFKEFSC SKTCSRLLDC KVHCCGEVCH  240
EGECPPCRAK GVYTCQCGKV KEERDCCERV FRCEIECGEM LGCGKHSCSK GCHEGECGKC  300
PLQGRRTCPC GKRVYEGMAC DVVAPSCGGT CDKKLECGIH RCPERCHRGS CVETCRIVVW  360
KSCKCGSLKK QVPCYQDVVC ERKCQRVRDC GRHACKRKCC DTDCPPCSEI CDKKLRCNNH  420
KCPSPCHRGA CAPCPVMVTI SCFCGETRFE VPCGTEKEQK PPKCSKRCQI PPLCRHKSIT  480
RPHKCHYGAC PQCRLACDEE YPCGHKCKLR CHGPIPPPNP EFTLKPKKKK HHHHQSESTP  540
GSPCPPCPEL VWRPCVGEHI GADRMTVCSN KAKFSCDNYC GNLLPCGNHF CTKTCHALKI  600
SGSGEQCEKC SLPCQREREP LCQHPCPLKC HPEECPPCKV LIKRSCHCGA MVHVFECLYY  660
NTLSAKEQIA VRSCSGPCHK KLPNCTHLCP EICHVGQCPS PEKCSKKVIV RCGCQTLKKD  720
WPCHQVQAAY HSSGRDPKDV TKNQFGLGLL ECNSDCKSKI KVDDSELHQR KPKAPEKKEP  780
DVENYVRKRK RRKDKVQEDH QVSSFQKLMG ILRMILLFVI IVVSLIAMAY FGYKGLMWLN  840
DWMNQVETQR QRKRYPRI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1787792RKRKRR
2787793RKRKRRK
3787794RKRKRRKD
4787795RKRKRRKDK
5788793KRKRRK
6789797RKRKRRKDK
7789794RKRRKD
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G05660.10.0NF-X1 family protein