PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PHT57666.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family AP2
Protein Properties Length: 1190aa    MW: 132940 Da    PI: 7.2243
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PHT57666.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP237.27.3e-12127174556
         AP2   5 GVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaarkklege 56 
                 G+r+ k +gr+ A Ird      +rk+++lg+f+t eeA +a+ + +++le+e
  PHT57666.1 127 GIRRQK-TGRYGAVIRDtI----RRKQVWLGTFDTVEEASQAYFNKKLELENE 174
                 999877.9*********33....35*************************986 PP

2AP234.84e-11279326556
         AP2   5 GVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaarkklege 56 
                 GVr+ k +gr+ A Ird      +rk+++lg+f+t eeA  a+   +++le+e
  PHT57666.1 279 GVRRQK-NGRYGAVIRDtI----RRKQVWLGTFDTVEEASLAHFSKKLELENE 326
                 9**888.8*********33....35*********************9999986 PP

3AP229.91.4e-09561606554
         AP2   5 GVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkle 54 
                 G+r+ k +g++ A I d    + ++k+++lg+f+t eeA +a+   + ++e
  PHT57666.1 561 GIRRQK-TGKYGAVITD----KiRHKQIWLGTFDTVEEASQAYFSKKFEFE 606
                 899777.9*********....4345******************88887776 PP

4AP234.26.5e-11710757455
         AP2   4 kGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkleg 55 
                  GVr+ k +gr+ A +rd    + +rk+++lg+f+t eeA +a+   + +le+
  PHT57666.1 710 VGVRRQK-NGRYGAVVRD----KiRRKQVWLGTFDTVEEASQAYFSKKSELEK 757
                 59**888.8*********....5456******************998888876 PP

5AP231.54.5e-10867913454
         AP2   4 kGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkle 54 
                 +G+r+ k +gr+ A I d      k+k+++lg+f+t eeA +a+   + +l+
  PHT57666.1 867 RGIRRQK-SGRYGAVITD----RiKHKKVWLGTFDTVEEASQAYLSKKSELK 913
                 9***888.7*********....4446******************88877765 PP

6AP225.53.3e-089761013646
         AP2    6 VrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaa 46  
                  V+ +k +g++  eIr p    ++ kr++lg+f taeeA + +
  PHT57666.1  976 VHKRKGSGKYTTEIRNP----ISkKRIWLGTFNTAEEASRVY 1013
                  888999999*******7....246**************9977 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1190 aa     Download sequence    
METDEQGLLA GENEEETSCK KQKITQIVVD KVKFTIPLEH TNHKAQTTSV ATERSKETDS  60
KMALFGNEWQ NMGTFLGSEE FKGFSSNMHK FQGKETSCIM SNVHGIESFG ECNTSISSGK  120
GKISLIGIRR QKTGRYGAVI RDTIRRKQVW LGTFDTVEEA SQAYFNKKLE LENEKLNQQG  180
NKEDRPEENC DQIQQPESPV VQCLSMANDQ TSDTACVNRI NSHETTHIVE VHKNKMSGEE  240
PGSSKETACG MASVRGTESS VECNTSTSCN SKGEISLIGV RRQKNGRYGA VIRDTIRRKQ  300
VWLGTFDTVE EASLAHFSKK LELENENLNQ QGNKESKSKE NIDQIQQPES PSVQCLSVGN  360
DQIQPPESPS VQCLSVANDP IQQPESPSLQ CLSVADDQIQ QPESPGAQCL SVANDQIQQP  420
ESHRVRCLSV ANDQIQQPES PGVQCLSVGN DQIQPPESSS VQCLSVANNQ TLDTASVSRI  480
NSHITTTHIL GVHKKNWSGK EQEFSKVTSC LMANVQGTES SNECSTTTSC LPTEKRSLLG  540
IRRQKNGRYG AVITDKRSLL GIRRQKTGKY GAVITDKIRH KQIWLGTFDT VEEASQAYFS  600
KKFEFEKLSQ QDNKDNKPKE NLDQIQQPES PVMQCLSMAI DPTLDTAGVN RINPHETTHI  660
VEVHKNNMSG KEPGSSIETP CLMASVHGTE SSAECNTSTS CNPTGKISLV GVRRQKNGRY  720
GAVVRDKIRR KQVWLGTFDT VEEASQAYFS KKSELEKKKL NQQRNKDNRS KKNGDRIQQP  780
GSPVVLASLS VTDVQEFDTA SVGMRNERID FHKTTHIVGV LKSKTAGKEP ESSKETTCLM  840
DNVHDTESYD EGNTTTSRDP TAKRSLRGIR RQKSGRYGAV ITDRIKHKKV WLGTFDTVEE  900
ASQAYLSKKS ELKKLERQSD KEDKPKKNCD QVQQPESHVV ASFPVANHDQ TLNAARVDRR  960
YKRFDPHETE TRYFRVHKRK GSGKYTTEIR NPISKKRIWL GTFNTAEEAS RVYQSNKLEF  1020
QKLVHAKRQC SNEQTCSKQD GKSEKLVNIK QGHENVDSEL ESAGGSKIVV QVSNSSNGGT  1080
EQRIDSHEIG TCKEAFYDYL SNKFDLQISN KVELQSNMPT DSSAREEKQE GQEDDEDLWM  1140
GEWVQLPGNR AVKFSLKLGL PIIDNYGSLL GEFSTLDDLS ICKTEDGNET
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1726732DKIRRKQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G68550.11e-11ERF family protein