Today I visualized the accuracy of my XML-Importer and the modeling.

It comes with popups that show the imported data of papers/authors. This way i was able to figure out three problems:
- Lots of Names have special letters, that my xml-inporter can’t handle yet
- I have to transform authors to only use the initials of the pre-names
- Titles in the SLR-csv are all lowercase. My importer however imports the uppercase versions. My error-visualization therefore ignores letter-cases.
Authors that aren’t red are either in the right format in the paper, or it’s a faulty hit (For authors I set the required similarities in the name to low. This however is not the case for the titles, there it’s pretty high, so it ignores like one mal-formated character)
So what’s next?
First I will try to fix the encoding issue. After that, I will change the model-creation, so it only uses the initials of pre-names. Additionally I’ll finally implement a caching system. So TODOs in chronological order:
- Fix encoding
- Fix name-format
- Caching



