Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trying to Test #2

Open
darkfrog26 opened this issue Jan 15, 2018 · 1 comment
Open

Trying to Test #2

darkfrog26 opened this issue Jan 15, 2018 · 1 comment

Comments

@darkfrog26
Copy link

I've created a fork of this project to upgrade to the latest version of Spark and make a few other changes. However, I'm having problems understanding the simple use-case of working with this in order to create unit tests:

https://github.com/darkfrog26/pu4spark/blob/master/src/test/scala/specs/SimpleUsageSpec.scala#L24

Any feedback on proper operation of would be appreciated. I'm still a bit new to Spark ML, but your README gives very little clarity on proper operation.

@astrakhantsev
Copy link
Collaborator

val weightedDF = puLearner.weight(training, "label", "features")
// TODO: what's next?

Next you can use 'outputLabel' column of weightedDF dataframe - it would contain or each instance the number from 0 to 1 reflecting classifier's confidence for that instance, i.e. how likely this instance is positive or negative.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants