Parallelize `predict` subcommand by koute · Pull Request #117 · modelfoxdotdev/modelfox

koute · 2022-03-13T10:46:34Z

Predicting from the CLI is painfully slow since it runs only on a single thread, so here's a quick fix.

Caveat: this essentially preloads the whole CSV into memory. It is still a net improvement though since prediction on a single core becomes a problem far earlier (for a CSV which is less than 100MB it's already unusably slow) than consuming too much memory does.

nitsky · 2022-03-13T15:24:26Z

@koute thanks for the PR! We decided to make prediction single threaded for exactly the reason you identified: to avoid buffering the whole CSV. I agree that it is a good idea to provide the option to parallelize predictions, but I think we should put it behind a command line argument, --parallel, that explains the tradeoff. Would you be willing to update this PR with that change?

koute · 2022-03-13T16:36:48Z

Okay, now it doesn't read the whole CSV but will still run in parallel. (:

Speed-wise it's the same; the old version with preloading the CSV:

real	0m11.739s
user	10m29.132s
sys	0m14.350s

the new version without preloading the CSV:

real	0m11.425s
user	9m53.702s
sys	0m51.725s

Is this acceptable or do you still want me to add an argument to disable parallelization?

Parallelize predict subcommand

7448ef7

Do not buffer the whole CSV file when running predict

39b60bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize `predict` subcommand#117

Parallelize `predict` subcommand#117
koute wants to merge 2 commits intomodelfoxdotdev:mainfrom
koute:main

koute commented Mar 13, 2022 •

edited

Loading

Uh oh!

nitsky commented Mar 13, 2022

Uh oh!

koute commented Mar 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

koute commented Mar 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nitsky commented Mar 13, 2022

Uh oh!

koute commented Mar 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

koute commented Mar 13, 2022 •

edited

Loading