PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP014109.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family HB-PHD
Protein Properties Length: 704aa    MW: 78883 Da    PI: 9.5259
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP014109.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox322.1e-10475519953
                  HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
     Homeobox   9 keqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRa 53 
                  ++ +e L ++F +n  ps++ +  L+k+lgL+ ++V+ WF+N R+
  PCP014109.1 475 RKAVEKLRQVFSENELPSRAVKDNLSKELGLNPEKVSKWFKNARY 519
                  567899*************************************97 PP

2PHD35.81.8e-12233288151
                  SBTTTSSTCT..TSSEEEBS.SSSSEEETTTSTSSSSHHSHHSS..TBSSHHHHTT CS
          PHD   1 rCkvCgksde..egelvlCd.gCkewfHlkClglkleseekpeg..ewlCeeCkek 51 
                  +C+ C+ +++  +++++lCd +C+ +fH+kCl+++l++e++p+g   w+C+ C++k
  PCP014109.1 233 FCAKCKLNEAfpDNDIILCDgTCNCAFHQKCLDPPLDTENIPRGeqGWFCKFCDCK 288
                  699999955455********56********************9999*******985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 704 aa     Download sequence    
MDTLVKNMRG SGKKSNLKES GKSGYSSSVA KLISSPSFKK GGKVPRVKKL KPKSSKTIIA  60
SALSKKRGAD SSSKASTSRN NDTNKKLMSR KEFHKVHDTD SSKKPSSVKL QDHKLSDNGS  120
DEKGEKRRRK KKPKKDKVEL DETSRLQRRT RYLLIKIKLE QNLIDAYAGE GWKGQSREKI  180
RPELELQRAN KQILNCKLGI RDAIQQLDSL SSVGSIADSF IAPDGSVSHE HIFCAKCKLN  240
EAFPDNDIIL CDGTCNCAFH QKCLDPPLDT ENIPRGEQGW FCKFCDCKME ILELVNAHLG  300
TCFPMNSGWQ DVFKEEATFP VGENSLLNPD EEWPSDDSED DDYKPERNVN SCSISRGGSD  360
DNVSEEELST DDVSVGSDES TDGEIVSGRR QRRSVDYKKL YDEMFGKDGP LLEQISDDED  420
WGPVKRKRRE KESDAASTLM TLYESERNPD IDHTEVKNIR SSDTQVRRSC FRIPRKAVEK  480
LRQVFSENEL PSRAVKDNLS KELGLNPEKV SKWFKNARYL ALKTRKEVSG KDHQAFTPGI  540
SKEPGYENVT GKAADLMASD SDDDTLAETV VHSPKNVNAA FRRKHPKSLS SPLRKSQQKA  600
PSCGSPSKSN KDGKESSDDV SLKKLMNTRT KEKRANLIAG GGGDGCRAAE LEMERLCKAK  660
GRLEHMRQKL LKLQNVKAKQ KSNKSLLHEH TVIYVPVAQL QEKV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1125133EKRRRKKKP
2126132KRRRKKK
3127133RRRKKKP
4127135RRRKKKPKK
5128158RRKKKPKKDKVELDETSRLQRRTRYLLIKIK
6128134RRRKKKP
7129136RKKKPKKD
8129159RRKKKPKKDKVELDETSRLQRRTRYLLIKIK
9426431RKRREK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29940.11e-157HB-PHD family protein