Skip to content

Rating prediction could stream test-file to save memory #432

Open
@jkleint

Description

@jkleint

I'm making predictions with a command like this:

rating_prediction \
    --recommender="$method" \
    --rating-type=byte \
    --training-file="$1" \
    --test-file="$2" \
    --test-no-ratings \
    --prediction-file="$3"

When running with a large "test-file", it uses a lot of memory; with a small test-file, it uses a little memory. I'm not familiar with the internals of MyMediaLite, but can't each prediction be made independently, facilitating a streaming read of the test-file? This would be both faster and enable the use of larger data sets.

It would also be awesome if I could read from a process or fifo (i.e., streaming read without seeking) so I could store my data compressed and uncompress it on the fly.

Metadata

Metadata

Assignees

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions