PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PHU28073.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family AP2
Protein Properties Length: 1190aa    MW: 132966 Da    PI: 7.1332
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PHU28073.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP237.27.3e-12127174556
         AP2   5 GVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaarkklege 56 
                 G+r+ k +gr+ A Ird      +rk+++lg+f+t eeA +a+ + +++le+e
  PHU28073.1 127 GIRRQK-TGRYGAVIRDtI----RRKQVWLGTFDTVEEASQAYFNKKLELENE 174
                 999877.9*********33....35*************************986 PP

2AP236.51.2e-11279326556
         AP2   5 GVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaarkklege 56 
                 GVr+ k +gr+ A Ird      +rk+++lg+f+t eeA  a+   +++le+e
  PHU28073.1 279 GVRRQK-NGRYGAVIRDtI----RRKQVWLGTFDTVEEASLAYFSKKLELENE 326
                 9**888.8*********33....35************************9986 PP

3AP229.91.4e-09561606554
         AP2   5 GVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkle 54 
                 G+r+ k +g++ A I d    + ++k+++lg+f+t eeA +a+   + ++e
  PHU28073.1 561 GIRRQK-TGKYGAVITD----KiRHKQIWLGTFDTVEEASQAYFSKKFEFE 606
                 899777.9*********....4345******************88887776 PP

4AP234.16.8e-11710757455
         AP2   4 kGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkleg 55 
                  GVr+ k +gr+ A +rd    + +rk+++lg+f+t eeA +a+   + +le+
  PHU28073.1 710 VGVRRQK-NGRYGAVVRD----KiRRKQIWLGTFDTVEEASQAYFSKKSELEK 757
                 59**888.8*********....5456******************998888876 PP

5AP231.54.5e-10867913454
         AP2   4 kGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkle 54 
                 +G+r+ k +gr+ A I d      k+k+++lg+f+t eeA +a+   + +l+
  PHU28073.1 867 RGIRRQK-SGRYGAVITD----RiKHKKVWLGTFDTVEEASQAYLSKKSELK 913
                 9***888.7*********....4446******************88877765 PP

6AP225.53.3e-089761013646
         AP2    6 VrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaa 46  
                  V+ +k +g++  eIr p    ++ kr++lg+f taeeA + +
  PHU28073.1  976 VHKRKGSGKYTTEIRNP----ISkKRIWLGTFNTAEEASRVY 1013
                  888999999*******7....246**************9977 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1190 aa     Download sequence    
METDEQGLLA DENEEETSCK KQKITQIVVD KVKFTIPLEH TNHKAQTTSV ATERSKETDS  60
KMALFGNEWQ NMGTFLGSEE FKGFSSNMHK FQGKETSCIM SNVHGIESFG ECNTSISSGK  120
GKISLIGIRR QKTGRYGAVI RDTIRRKQVW LGTFDTVEEA SQAYFNKKLE LENEKLNQQG  180
NKEDRPEENC DQIQQPESPV VQCLSMANDQ TSDTACVNRI NSHETTRIVE VHKNKMSGEE  240
PGSSKETACG MASVRGTESS VECNTSTSCN SKGEISLIGV RRQKNGRYGA VIRDTIRRKQ  300
VWLGTFDTVE EASLAYFSKK LELENENLNQ QGNKESKSKE NIDQIQQPES PSVQCLSVGN  360
DQIQPPESPS VQCLSVANDP IQQPESPSLQ CLSVADDQIQ QPESPGAQCL SVANDQIQQP  420
ESHRVRCLSV ANDQIQQPES PGMQCLSVGN DQIQPPESSS VRCLSVANNQ TLDTASVSRI  480
NSHITTTHTL GVHKKNWSGK EQEFSKVTSC LMANVQGTES SNECSTTTSC LPTEKRSLLG  540
IRRQKNGRYG AVITDKRSLL GIRRQKTGKY GAVITDKIRH KQIWLGTFDT VEEASQAYFS  600
KKFEFEKLSQ QDNKDNKPKE NLDQIQQPES SVMQCLSMAI DPTLDTAGVN RINPHETTHI  660
VEVHKNNMSG KEPGSSKETP CLMASVHGTE SSAECNTSTS CNPTGKISLV GVRRQKNGRY  720
GAVVRDKIRR KQIWLGTFDT VEEASQAYFS KKSELEKKKL NQQRNKDNRS KKNGDRIQQP  780
GSPVVLASLS VTDVQAFDTA SVGMRNERID FHKTTHIVGV HKSKTAGKEP ESSKETSCLM  840
DNVHDTESYD EGNTTTSRDP TAKRSLRGIR RQKSGRYGAV ITDRIKHKKV WLGTFDTVEE  900
ASQAYLSKKS ELKKLERQSD KEDKPKKNCD QVQQPESHVV ASFPVANHDQ TLNAARVDRR  960
YKGFDPHETE TRYFRVHKRK GSGKYTTEIR NPISKKRIWL GTFNTAEEAS RVYQSNKLEF  1020
QKLVHAKRQC SNEQTFSKQD GKSEKLVNIK QGHENVDSEL ESAGGSEIVV QVSNSSNGGT  1080
EQRIDSHEIG TCEEAFYGYL SNKFDLQISN KVELQSNMPT DSSAREEKQE GQEDDEDLWM  1140
GEWVQLPGNR AVKFSQKLGL PIIDNYGSLL GEFSTLDDLS ICKTEDGNET
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1726732DKIRRKQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G68550.12e-12ERF family protein