Overview
This is a dataset for multi-task learning with a large number (957) of tasks, first used in [1]. It was created by selecting a subset of products from Julian McAuley's Amazon product dataset [2], for which there are at least 300 positive reviews (with scores 4 or 5) and at least 300 negative reviews (with scores 1 or 2). Each of the resulting 957 products is treated as a binary classification task of predicting whether a review is positive or negative.