PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CCG033096.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus
Family bHLH
Protein Properties Length: 602aa    MW: 66202.8 Da    PI: 7.732
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CCG033096.1genomeLZUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH36.49.3e-12415461455
                  HHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
          HLH   4 ahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                   h ++Er+RR+++ + f  L  ++P +     kK++Ka++Le A++Y+k+Lq
  CCG033096.1 415 DHIMAERKRRKKLSQQFIALSAVVPGL-----KKMDKASVLEGAMKYMKQLQ 461
                  599************************.....9******************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF509783.89E-688227IPR017986WD40-repeat-containing domain
Gene3DG3DSA:2.130.10.107.9E-699223IPR015943WD40/YVTN repeat-like-containing domain
SMARTSM003203.9E-81049IPR001680WD40 repeat
CDDcd002009.39E-6313238No hitNo description
PfamPF004006.0E-51346IPR001680WD40 repeat
PROSITE profilePS5029451.70217234IPR017986WD40-repeat-containing domain
PROSITE profilePS5008212.6811758IPR001680WD40 repeat
SMARTSM003202.9E-115998IPR001680WD40 repeat
PfamPF004001.7E-96298IPR001680WD40 repeat
PROSITE profilePS5008218.42966107IPR001680WD40 repeat
PROSITE patternPS0067808599IPR019775WD40 repeat, conserved site
PRINTSPR003202.3E-88599IPR020472G-protein beta WD-40 repeat
SMARTSM003209.0E-13101140IPR001680WD40 repeat
PfamPF004004.2E-9103140IPR001680WD40 repeat
PROSITE profilePS5008219.264108149IPR001680WD40 repeat
PROSITE patternPS006780127141IPR019775WD40 repeat, conserved site
PRINTSPR003202.3E-8127141IPR020472G-protein beta WD-40 repeat
SMARTSM003203.8E-13143182IPR001680WD40 repeat
PfamPF004002.0E-10144182IPR001680WD40 repeat
PROSITE profilePS5008215.454150191IPR001680WD40 repeat
PRINTSPR003202.3E-8169183IPR020472G-protein beta WD-40 repeat
PROSITE patternPS006780169183IPR019775WD40 repeat, conserved site
SMARTSM003201.8185225IPR001680WD40 repeat
PROSITE profilePS5088814.322411460IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene3DG3DSA:4.10.280.101.3E-15415473IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474593.93E-16416478IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000831.53E-9416457No hitNo description
PfamPF000101.5E-9416461IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003534.5E-14417466IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 602 aa     Download sequence    Send to blast
MPDPIESYKP YTLTQTLQGH KSSISSVKFS SDGRLLGSSS ADKTIKTYSL SPSNPPTSPI  60
TPLHDFHGHE QGVSDLAFSS DSRFIVSASD DKTLRLWDVT TGSTIKTLHG HTNYVFCVSF  120
NPNSSMIVSG SFDETVRIWD VKSGKCLKVL PAHSDPVTCV DFNRDGSLIV SSSYDGLCRI  180
WDSGTGHCIK TLIDDENPPV SFVKFSPNGN YILVGTLDNN LGMDDYKFIH QCHINSLAEF  240
TAQNMATTLL GENLQRSFSS ESFSSKPSLM MTRNTTITST SNGSSSETSQ TSIETPGKQQ  300
RTNSWNSSFS TLHQSPKPTS SFSTPHQSPK PPSPIPESFS FNTSAPPPTA SSQQFYGNLD  360
RLIKPKDEAA SPINMHFQTS ISKAACERSE SYAPEAKQGI KRPYSMTRSA MHVQDHIMAE  420
RKRRKKLSQQ FIALSAVVPG LKKMDKASVL EGAMKYMKQL QEQLKQLQDQ TKTKTMESVV  480
LLKKSKLSVD DECSSSDENF DGLPDSPLPE IEARTTDKDV LIRIHCKNQQ GVGIKILSEI  540
ENLHLSVVNS SVLVFGNSTL DVTVIAQMDN DFSLTMKDLV KKLRLACMKL SCAIIPSSCI  600
QA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2gnq_A2e-95222630334WD-repeat protein 5
2xl2_A1e-95222628332WD REPEAT-CONTAINING PROTEIN 5
2xl2_B1e-95222628332WD REPEAT-CONTAINING PROTEIN 5
2xl3_A1e-95222628332WD REPEAT-CONTAINING PROTEIN 5
2xl3_B1e-95222628332WD REPEAT-CONTAINING PROTEIN 5
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1419426ERKRRKKL
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011021822.10.0PREDICTED: transcription factor bHLH18
RefseqXP_011021823.10.0PREDICTED: transcription factor bHLH18
RefseqXP_011021824.10.0PREDICTED: transcription factor bHLH18
TrEMBLA0A3N7GPD80.0A0A3N7GPD8_POPTR; Uncharacterized protein
STRINGPOPTR_0007s14480.10.0(Populus trichocarpa)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G37850.12e-34bHLH family protein