PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pd.00g665670.m01
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family bHLH
Protein Properties Length: 1048aa    MW: 116676 Da    PI: 4.3762
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pd.00g665670.m01genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH37.73.7e-12351392854
                       HHHHHHHHHHHHHHHHHCTSCC.C...TTS-STCHHHHHHHHHHHHHH CS
               HLH   8 rErrRRdriNsafeeLrellPk.askapskKlsKaeiLekAveYIksL 54 
                       +Er+RR+++N+++ +Lr+l+P+ +      K++Ka+iL  A++Y+k+L
  Pd.00g665670.m01 351 AERKRRKKLNERLYRLRSLVPNiS------KMDKAAILGDAIDYVKDL 392
                       7*********************66......****************98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1048 aa     Download sequence    
MQNLTERLRP LVGLKGWDYC VIWKLSENQR FIELMDCCCS GADENTQTNG GQELLFPVYP  60
VLPCRDTMQH SRTNSCDVLA HLPSSLSLDS GIYAHPLISN QPIWLNSSSN RDSSAMAERD  120
GTRVLVPSAG GLIELFVSKD VSEDQQVIDY ITAQCNISKE QDTLLGAGCN TSFPVNINDM  180
STEIQPHVFP GNENEGNDNL NSNHFQQPPV VSSSVDHDSN VPYDISVDRI RLCSASPMNF  240
LQHVTYNPEN SMKNGNSNVY YEQRSHESLG GMGLQADADA SNMHNSMHVM EALENMEQQG  300
VEDQDSVKHE AQGGRTADNS GSDCSDQIDD EDDTKYRRRT GKGPQSKNLF AERKRRKKLN  360
ERLYRLRSLV PNISKMDKAA ILGDAIDYVK DLLRQVKELQ DELEQHSNDE GPNTKTSANI  420
CGNHNNFQPE ILNQNGTSIT NEPENDDKPP NGFHVGTAGD IGNISKQQQD SDSTNDRGQQ  480
MEPQVGVTQL DGNELFVTVF CEHKPGGFVR LMEALDTLGL EVTNANVTSF RSLVSNVLKV  540
EKKDSEVVQA DDVRDSLLEI TRNPSSKVWP EMAKAKAKAS ENGSGMNIMM IMPSLIQRLR  600
SLVGLKGWDY CVIWKLSEDQ RFIEWMDCCC SGTEITQYDA GQDLLFPPVL PCRDTMLQHP  660
RTTACDLLAK MPSSLPLDSG FVPIYMETDG TKVLIPIPGG LIELLVTKQV FEDQHVIDFI  720
TAQYSISMEQ DTLDNITGTS IMSEIESKNL LDNGNDQMDI NNVLFQATLS SRICSKLDNL  780
NPPYDVSMGT ISNSPIKFLQ QFTNYNSGNR TKNSIDVSYG DASHEPFLSD KQMDPFICSA  840
ENGFQEMEAM QRSMMGDETQ QHMHMQYMEA LAPDMDQQDG NDKQDSIIHD EGPAADGLAS  900
ILGDAIEYVQ ELQKQAKQLQ DELDDHADDE GPKNSGITGH HNNIQSEIQS ELDPGGPKTD  960
HQHDSISKQS QDSDVIHDHK TQQMEPQVEV AQLDGNQFFV KVFCEHKPGG FVRLMEALSS  1020
LSLEVINANV TSFRCLVSNV FIVEVSFF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1337358RRRTGKGPQSKNLFAERKRRKK
2352359ERKRRKKL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16910.11e-111bHLH family protein