PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP017818.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family bHLH
Protein Properties Length: 2644aa    MW: 292849 Da    PI: 5.6788
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP017818.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH33.29.5e-11607652555
                  HHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
          HLH   5 hnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                  h+++Er RR++i +++ +L+el+Pk+      + +Ka++L + +eY++ Lq
  PCP017818.1 607 HSIAERLRREKISDRMKNLQELVPKS-----NRTDKASMLDEIIEYVRFLQ 652
                  99***********************7.....59*****************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2644 aa     Download sequence    
MRRRKEAKME NPLSSHSHSR INFIDIEDDT FRFTPTSRNP SINPINVDDE DQDLRSITAS  60
LLKSVPTPAE ADNFIDLSEE TLDFYDGDDE LRILRFKPSK TPFGKRGKGL FSDFSATESG  120
QSSNSKGDPD FVCEICVEPK SGIESFRIEN CSHGYCTECM AKYVASKLQE NITSIRCPVP  180
DCIGLLEPEY CRPILPPEVF DQWGTALCEA VILGSEKFYC PYKDCSAMLI DDGKEVVRQS  240
ACPNCWRMFC AQCKAPWHEG IECEEFLKLN KDEREREDIM LKNLAQKQQW RRCPKCRFYV  300
EKSMGCMFMM CRCRTAFCYR CGDVLENIHN HYCPSCGGVS NGFHPLPVTL SSHTKSEISS  360
SSWSDVDAKE IPSWACCESE QPNEALLGSV VVVDEHENNS SPTYLTYSNS LESLAAQDVS  420
SILQDFEPDA QDGGKNCDVQ NIGMHYGVDG NMVRIPLRSI APENVGCNSF EPLLYQGFNG  480
DFETLSPISQ PWPLQSYEGV SPAPASGMVQ RKMCSFGING DYGDYVDYVM PSDPNNLAAL  540
VTNTFNVYHK KGTQDVQNYP LPSFPSAPRM SGLQSLPQNT STNPVGECNG NGKPRMRARR  600
GQATDPHSIA ERLRREKISD RMKNLQELVP KSNRTDKASM LDEIIEYVRF LQLQLKELSM  660
SKVGAAGAVI PLTTDSQAKV GNGLPQLPSI GQAADVSFDE IALEQLVRLM ESDADKALQH  720
LQSEGFCLVP VALADAISAA KESSSSRPSA PVPDDWKKKE TSFADIGVVQ NNSRSSSNSS  780
SSPDRIKRER RVAVEAERKA LLGLRIELLE FLVISQNLSL QKHLNLGTLA KWGFKFAASN  840
YWFWRSEPPN VSRDFWMPDQ SCRVCYDCDS QFTIFNRRHH CRLCGRVFCA KCTANSIPAA  900
SDEPRAGRED WERIRVCYYC FKQWEQGVVA PNNGAGPAAS PGLSPSPSAT SLASTKSSCT  960
CHSSSSTIGS TPYSTGPYQH VPYSSGRSPS QSSSQIDSVP VQQDNVTSQT SISSDVAMAE  1020
PSLNQYGFCV NRSDDEDDDY GVYRLDSEPS HLSHGNDYYG AVTIEEFASV YGPQNVHLDG  1080
DNTSSLLPGS FDTQDAVGIH KIEEEPYEHD NGDQCGTSPY DLQSTNTEPV DFENNGLLWL  1140
PPEPEDEEDE REAVLFDDDD YDGGGSGGGA GEWGYLGSSN SFGNGECRTR EKSIEEHRKA  1200
MKNVVEGHFR ALVSQLLQVE NLPLGDEGNN ESWLDIITSL SWEAATLLKP DTSKGGGMDP  1260
GGYVKVKCIA CGRRTDSTVV KGVVCKKNVA HRRMTSKIEK PRFLILGGAL EYQRVSNLLS  1320
SFDTLLQQEM DHLKMAVAKI DSHHPNVLLV EKSVSRYAQD YLLAKDISLV LNIKRPLLER  1380
IARCTGAQIV PSIDHLTSPK LGFCDMFHVE KFLEVHGSAG QGGKKLTKTL MFFEGCPKPL  1440
GVTVLLYGAN GDELKKVKHV VQYGVFAAYH LALETSFLAD EGASLPELTL KSEITVALPD  1500
KPSSIDRSIS TIPGFSVPPA GKPQGPDASR ELQKSNQGLI SDNNSSTTSG PILNMQGADS  1560
ICSSKACSQA FLIEHALSSR ESRSPFTSLS PPEEDITECY RKELPSICAS ENKIDAGSKD  1620
SCLDNPAQAG EALLNSSLIS NSLATSESLG HGGGALAANH GETPELTSIK HHSDYQNEEV  1680
GSSKEEFPPS PSDHQSILVS LSTRCVWKGT VCERSHLFRI KYYGSFDKPL GRFLRDHLFD  1740
QNYLCRSCGM PSEAHVHCYT HRQGSLTISV KKLPEILLPG EREGKIWMWH RCLKCPRANG  1800
FPPATRRVVM SDAAWGLSFG KFLELSFSNH AAANRVATCG HSLHRDCLRF YGFGRMVACF  1860
RYASIHVHSV YLPPQKLEFN YDNQEWIQKE VEEVGHRAEL LFTELHNALN QILETRPISG  1920
TPDGGKKAPE SSHQIVELEE MLQKEREDFE ESLQKAMHRE VKCGQPAVDI LEINRLRRQL  1980
LFHSYIWDQR LIQAASLSKN SFQEGLRSSL PKLKEKPISS MEKLVETNIN SKAGKGFSSC  2040
DSSLRETKPD VSIYQGGDVG GFSQPEGEQK NNEIVQNPNH SNEAKISTRS SENAMDKSDP  2100
LESGLSVRRA LSEGNESLVV ANLSDTLDAA WTGESHPTSM IPKENGYSKP DSTLVNSPTV  2160
MRKVASNSDL QNCAVDQAGV QTTASTHSLS STSSLKVFDK SYSLNAQKIN IGEYNPVNVP  2220
MFRESERQSG ARLLLPIGIN DTVIPVFDDE PTSVIAYALV SPDYHVQISE SERPRDAMDG  2280
SVSVPLFDSA NLLSLSSFDE SFSETYRNIG SSDESMSSVS RSRSSQALDS LLSKDIHARV  2340
SFTDDGPLGK VKYTVTCYYA TRFEALRRTC CPSERDFVRS LSRCKKWGAQ GGKSNVFFAK  2400
TLDDRFIIKQ VTKTELESFI KFAPSYFKYL SESISTRSPT CLAKILGIYQ VSSKHGKAGK  2460
ESKMDVLVME NLLFRRNVTR LYDLKGSSRS RYNPDTSGSN KVLLDQNLIE AMPTSPIFVG  2520
SKAKRRLERA VWNDTAFLAS IDVMDYSLLV GVDEEKDELV VGIIDFMRQY TWDKHLETWV  2580
KTSGILGGPK NTSPTVISPQ QYKKRFRKAM TTYFLMVPDQ WSPLMIVPSG SQSDLGEETA  2640
QDPS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G58010.12e-45bHLH family protein