fixed multiprocessing issue during RoBERTa prediction - Solution to #1558 #1559

etqadkhan · 2023-11-24T16:59:32Z

With reference to #1558
When using a RoBERTa model for doing prediction, when the model is loaded using ClassificationModel() it prompts a warning stating,

UserWarning: use_multiprocessing automatically disabled as xlmroberta fails when using multiprocessing for feature conversion.

When prediction is performed on text data, if the number of records are a little more than a handful, the prediction progress bar takes infinite execution.

The issue occurs because in the classification_model.py file under the ClassificationModel class, there are two arguments that concern with multiprocessing, them being, args.use_multiprocessing and args.use_multiprocessing_for_evaluation. While one is set to False by default, the other remains to be True,as can be seen in the screenshot attached:

To be able to perform prediction by successfully disabling multiprocessing, we need to disable args.use_multiprocessing_for_evaluation = False and it should work fine. The approach has been tested locally and has proven to be working. Screenshot attached:

wesngoh · 2024-10-26T08:03:48Z

Faced a similar issue and am glad to have found your workaround, im surprised this have not been pushed after a year.

fixed multiprocessing issue during RoBERTa prediction

7b69dcc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed multiprocessing issue during RoBERTa prediction - Solution to #1558 #1559

fixed multiprocessing issue during RoBERTa prediction - Solution to #1558 #1559

etqadkhan commented Nov 24, 2023

wesngoh commented Oct 26, 2024

fixed multiprocessing issue during RoBERTa prediction - Solution to #1558 #1559

Are you sure you want to change the base?

fixed multiprocessing issue during RoBERTa prediction - Solution to #1558 #1559

Conversation

etqadkhan commented Nov 24, 2023

wesngoh commented Oct 26, 2024