[ENH] big data GAM by dswah · Pull Request #188 · dswah/pyGAM

dswah · 2018-07-22T18:17:34Z

write an example like pomegranate out of core:
https://pomegranate.readthedocs.io/en/latest/ooc.html

subsequent PR?

use joblib with Pool? (this will enable use of dask)
use batch_size instead of block_size
enable mini-batches, add batches_per_epoch parameter and partial_fit method

codecov · 2018-07-23T23:21:10Z

Codecov Report

❗ No coverage uploaded for pull request base (master@b986ec5). Click here to learn what that means.
The diff coverage is n/a.

@@            Coverage Diff            @@
##             master     #188   +/-   ##
=========================================
  Coverage          ?   91.33%           
=========================================
  Files             ?       19           
  Lines             ?     2492           
  Branches          ?        0           
=========================================
  Hits              ?     2276           
  Misses            ?      216           
  Partials          ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b986ec5...03898ea. Read the comment docs.

dswah · 2018-07-24T09:56:40Z

awesome!!!! just tried a dataset that crashes my notebook when no partitioning is used, but that correctly solves when the optimization is incremental!!!!!

maorn · 2018-07-24T10:36:06Z

great 😊, i will convert you for loop into parallel one during the weekend Maor

…

________________________________ From: daniel servén <notifications@github.com> Sent: Tuesday, July 24, 2018 12:56:40 PM To: dswah/pyGAM Cc: Subscribed Subject: Re: [dswah/pyGAM] [WIP] big data GAM (#188) awesome!!!! just tried a dataset that crashes my notebook when no partitioning is used, but that correctly solves when the optimization is incremental!!!!! — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub<#188 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ASBgn3ToKBMHlrWCFpfXxI-UZ-UnUuSvks5uJu9YgaJpZM4VaF7t>.

maorn · 2018-07-26T14:15:41Z

hi, i have changed the code now it should work in parallel, i cannot push it into the branch, can you give me access ? Regards, MAor

…

________________________________ From: Maor Nissan Sent: Tuesday, July 24, 2018 1:36:02 PM To: dswah/pyGAM; dswah/pyGAM Cc: Subscribed Subject: Re: [dswah/pyGAM] [WIP] big data GAM (#188) great 😊, i will convert you for loop into parallel one during the weekend Maor

________________________________ From: daniel servén <notifications@github.com> Sent: Tuesday, July 24, 2018 12:56:40 PM To: dswah/pyGAM Cc: Subscribed Subject: Re: [dswah/pyGAM] [WIP] big data GAM (#188) awesome!!!! just tried a dataset that crashes my notebook when no partitioning is used, but that correctly solves when the optimization is incremental!!!!! — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub<#188 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ASBgn3ToKBMHlrWCFpfXxI-UZ-UnUuSvks5uJu9YgaJpZM4VaF7t>.

dswah · 2018-07-26T14:53:45Z

@maorn that is really cool!

to contribute your code, please do the following:

put your changes in a safe place
fork the repo, and clone your fork on your computer
commit your changes (ie parallel code into pygam.py)
push your changes to your remote repo fork
open a pull request from your remote repo to this branch

Attention!!
please make sure that you dont lose the code you've already written!

copy it or something before forking/cloning...

looking forward to reading your code :)

maorn · 2018-10-21T08:08:32Z

hi,
what is the state of this branch?
is there anything missing on my hand for committing it to the master branch?

dswah · 2018-10-21T10:46:34Z

hi @maorn!
i think there are still a couple of things we need to do before we merge:

a rebase of your 'parallel' branch off of this one
logic for skipping any parallelism if n_cores==1
logic for partial dependence and quantiles that uses the new features
add some tests for the new features
fix a couple of broken tests

adding parrallel for-loop

mohsenzabihi · 2019-05-17T13:31:08Z

Hi @maorn and @dswah, may I know about the status of this work? do you plan to merge it into master?

dswah · 2019-07-16T15:53:00Z

@mohsenzabihi @ccurro The plan is to merge this branch into master in August.

But it needs a little love right now.
Specifically, i need to

adapt all remaining ocurrences of gam._modelmat like in partial dependence and quantiles to use the new blockwise scheme
remove joblib for now since it doesn't look like we get any benefit from parallelizing linear algebra operations

tjburch · 2022-02-28T14:52:55Z

I know this PR is pretty old, but I'd still be really happy to see this functionality implemented. Figured I'd just mention it since it's been a couple of years since there's been any updates.

WIP QR updating

f2b9c98

dswah changed the title ~~big data GAM~~ [WIP] big data GAM Jul 22, 2018

dswah added 4 commits July 23, 2018 10:11

more qr updating

60ab228

wooo blockwise QR!

130d815

blockwise PIRLS!

5cf0bf4

callbacks working

bea43c2

dswah mentioned this pull request Jul 23, 2018

Memory consumption error #187

Open

dswah added 8 commits July 23, 2018 14:40

pirls is incremental, begin blockwise decorator

d06f4ad

improve docs

4be3b0b

add gamma scaling to constructor

3fb3f63

extend blocks to all models

80ce027

improve blockwise decorator

751baff

all statistics run, also naive_pirls

177b3f7

improve blockwise decorator for args and kwargs

9767cc2

sample method also partitions X

35dc9e1

dswah added 3 commits July 24, 2018 01:21

partial_dep also partitions X

c388b32

formatting

d51cae2

parralelizable version of incremental QR!

7ab929d

maorn and others added 7 commits July 29, 2018 07:44

adding parrallel for loop

90dffdd

add progress bar for out of core progress

73ffe06

reduce memory footprint

ad68c80

reduce memory footprint

39b67be

use np.asanyarray to reduce memory footprint

f156f5c

Merge branch 'master' into bam

1a73a58

Merge branch 'bam' into bam

f602784

dswah added 2 commits October 19, 2018 19:52

lots of bug fixes

6a6d0f5

lots of little fixes

abd2bff

dswah added 7 commits October 21, 2018 14:49

Merge branch 'bam' into bam

530fe3c

reintroduce missing lines....

d13f92d

get rid of reference to None.__ne__

020b96b

Merge pull request #189 from maorn/bam

04fa49b

adding parrallel for-loop

add joblib parallel execution

50d14ae

add joblib to requirements

b986ec5

Merge branch 'master' into bam

03898ea

shyamcody pushed a commit to shyamcody/pyGAM that referenced this pull request Jul 21, 2020

added gamma to constructor following dswah#188 PR.

17d6428

shyamcody mentioned this pull request Jul 21, 2020

gridsearch accepts gamma parameter to exaggerate dof #76

Open

CatarinaPC mentioned this pull request Sep 28, 2021

Bam algorithm implementation #304

Open

dswah added 7 commits November 19, 2025 22:40

fix BAM

0da2e65

remove reffs to joblib

92baffc

typo

5cb10ea

allow shuffled batches

234de92

linting

de4b180

linting

75c6e5f

improve stability

2c2fd9c

dswah changed the title ~~[WIP] big data GAM~~ [ENH] big data GAM Dec 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] big data GAM#188

[ENH] big data GAM#188
dswah wants to merge 39 commits intomainfrom
bam

dswah commented Jul 22, 2018 •

edited

Loading

Uh oh!

codecov bot commented Jul 23, 2018 •

edited

Loading

Uh oh!

dswah commented Jul 24, 2018

Uh oh!

maorn commented Jul 24, 2018 via email

Uh oh!

maorn commented Jul 26, 2018 via email

Uh oh!

dswah commented Jul 26, 2018 •

edited

Loading

Uh oh!

maorn commented Oct 21, 2018

Uh oh!

dswah commented Oct 21, 2018 •

edited

Loading

Uh oh!

mohsenzabihi commented May 17, 2019

Uh oh!

dswah commented Jul 16, 2019 •

edited

Loading

Uh oh!

tjburch commented Feb 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dswah commented Jul 22, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

subsequent PR?

Uh oh!

codecov bot commented Jul 23, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dswah commented Jul 24, 2018

Uh oh!

maorn commented Jul 24, 2018 via email

Uh oh!

maorn commented Jul 26, 2018 via email

Uh oh!

dswah commented Jul 26, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maorn commented Oct 21, 2018

Uh oh!

dswah commented Oct 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mohsenzabihi commented May 17, 2019

Uh oh!

dswah commented Jul 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tjburch commented Feb 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dswah commented Jul 22, 2018 •

edited

Loading

codecov bot commented Jul 23, 2018 •

edited

Loading

dswah commented Jul 26, 2018 •

edited

Loading

dswah commented Oct 21, 2018 •

edited

Loading

dswah commented Jul 16, 2019 •

edited

Loading