Example of file format for NER

Hey! I think it would be useful to have a more detailed explanation about:

1. what the dataset should look like for performing NER, similar to the [fine-tuning example](https://github.com/deepset-ai/FARM/blob/97b0211a37ea7c7d64b4602f0e21b65428b2bd76/test/samples/lm_finetuning/test-sample.txt#L1). The [NER sample](https://github.com/deepset-ai/FARM/blob/97b0211a37ea7c7d64b4602f0e21b65428b2bd76/test/samples/ner/train-sample.txt#L1) is great and I think it could be further improved to include an explanation about what input is expected: format of the dataset, separating sentences with newlines etc. 

2. what the [start_token, end_token] mean in label_list for NER.  Documentation on why we need label_list=['[PAD]', 'X', label1, label2, ...]. 

I could also help with the first one, based on my learnings so far :) 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example of file format for NER #287

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Example of file format for NER #287

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions