PUbiasedN

PyTorch implementation for experiments in the paper Classification from Positive, Unlabeled and Biased Negative Data.

Requirements

Python >= 3.6
PyTorch >= 0.4.0, scikit-learn, NumPy
yaml to load parameters
nltk, allennlp, h5py to prepare the 20newsgroups ELMO embedding

Usage

The file pu_biased_n.py allows to reproduce most of the experimental results described in the paper:

python(3) pu_biased_n.py --dataset [dataset] --params-path [parameter-path] --random-seed [random-seed]

where dataset is either mnist, cifar10 or newsgroups and parameter-path is a yml file containing the hyperparameters of the experiment. The hyperparameter files used for the results shown in Table 1 can be found under the params/ directory.

20newgroups preprocessing

To prepare the ELMO embedding of the 20newsgroups dataset. Please download the ELMO 5.5B pre-trained model from https://allennlp.org/elmo (elmo_2x4096_512_2048cnn_2xhighway_5.5B_weights) and put it under data/20newsgroups/; then run the two files train_elmo_prepare.py and test_elmo_prepare.py located in this same directory.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
cifar10		cifar10
data/20newsgroups		data/20newsgroups
mnist		mnist
newsgroups		newsgroups
params		params
.gitignore		.gitignore
README.md		README.md
pu_biased_n.py		pu_biased_n.py
settings.py		settings.py
training.py		training.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PUbiasedN

Requirements

Usage

20newgroups preprocessing

About

Uh oh!

Releases

Packages

Languages

cyber-meow/PUbiasedN

Folders and files

Latest commit

History

Repository files navigation

PUbiasedN

Requirements

Usage

20newgroups preprocessing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages