Closes #1140 spm load by HenkMutsaerts · Pull Request #1141 · ExploreASL/ExploreASL

HenkMutsaerts · 2022-08-03T15:42:48Z

Linked issue

Closes #1140

How to test

Required: if not defined in the linked issue, add a simple test description here

Comments

Optional: add helpful comments for the reviewers here

jan-petr

Sorry, but this needs a complete overhaul. Can you send the wrong file?! And I'll code it nicely.

External/SPMmodified/spm_load.m

HenkMutsaerts · 2022-08-04T16:32:39Z

Great refactoring, but you now report and fix all empty cells; whereas we only should report and fix the empty cells at the end of lines (so too many delimiters at end of lines). Details:

171 "If there's a single line-end only in the data, then consider everything a single line a divide (if possible) by the number of columns" -> what do you mean? difficult to read

174 rem(numel(d{1}, N) -> this used to be the check if we can reshape by number of columns, so this check is now redundant?

208 Am I correct that you are here filling empty cells with empty strings? That's not what we should do. Empty cells are not the problem (if they were, this should be filled with 'n/a' per BIDS (or NaN)). By the way, difficult to read for me, I need more comments ;)

217 -> this warning is incorrect, it should only warn for empty cells at row ends, not for any empty cells

222 -> this warning is correct! We should add +1 though to compensate for the header.
And the repaired file is the original file, so the line endings still have too many delimiters (or empty cells)

FixIllegalEmptyCells and DetectIllegalEmptyCells are not used anymore?

jan-petr · 2022-08-04T17:00:28Z

Great refactoring, but you now report and fix all empty cells; whereas we only should report and fix the empty cells at the end of lines (so too many delimiters at end of lines). Details:

It fixes too much or too little cells per line. It reports all empty for convenience of user that he knows he has empty cells, but I can disable that.

HENK: Thanks, no it should only report the illegal empty cells at end of lines (222).

208 Am I correct that you are here filling empty cells with empty strings? That's not what we should do. Empty cells are not the problem (if they were, this should be filled with 'n/a' per BIDS (or NaN)). By the way, difficult to read for me, I need more comments ;)

If cells are empty, then they are initialised as empty. I am only changing type of the empty cell from double to char. Because all the read cells are chars.

HENK: Are you sure it won't read any cells as numeric? If all cells are read as char then this is fine of course; otherwise this is just redundant code and we can skip this whole part. We don't care about empty cells that are not at the eol.

222 -> this warning is correct! We should add +1 though to compensate for the header. And the repaired file is the original file, so the line endings still have too many delimiters (or empty cells)

I added +1 to compensate for header.
Repaired file is correct. I has the correct number of delimiters to have the same number of delimiters for each line. That is - having delimiters also for empty cells. It rectifies the number of delimiters as this should correctly be. Instead of removing the delimiters for empty cells which would have been wrong.

FixIllegalEmptyCells and DetectIllegalEmptyCells are not used anymore?

HENK: ???

NEW TEST:

It still mentions all empty cells. It should only mention the empty cells at the end of the line (making a line having more cells than the header has). Did you remove the wrong warning? The warning on line 222 was correct, the warning on line 217 was incorrect.

The code still doesn't remove empty cells at the end of the line... It should remove the empty cells at the end of the line, such that the number of delimiters is the same for each line.

jan-petr · 2022-08-04T18:31:50Z

208 Am I correct that you are here filling empty cells with empty strings? That's not what we should do. Empty cells are not the problem (if they were, this should be filled with 'n/a' per BIDS (or NaN)). By the way, difficult to read for me, I need more comments ;)

If cells are empty, then they are initialised as empty. I am only changing type of the empty cell from double to char. Because all the read cells are chars.

HENK: Are you sure it won't read any cells as numeric? If all cells are read as char then this is fine of course; otherwise this is just redundant code and we can skip this whole part. We don't care about empty cells that are not at the eol.

Yes - numbers are read also as chars.
So this code makes it consistent. Maybe not really necessary, but it helps. Maybe not now, but for future functions, it is good to have clean outputs.

FixIllegalEmptyCells and DetectIllegalEmptyCells are not used anymore?
HENK: ???

Yes - these are deleted - not used anymore.

NEW TEST:

It still mentions all empty cells. It should only mention the empty cells at the end of the line (making a line having more cells than the header has). Did you remove the wrong warning? The warning on line 222 was correct, the warning on line 217 was incorrect.

Sorry. Yes I deactivated the wrong warning. Function was intact though. Just the warnings were switched.

As before. It finds lines with too few or too much delimiters (or empty cells). It doesn't really care if the cell is empty or not. Just checks really the delimiters. So reporting is fixed, but function unchanged.

The code still doesn't remove empty cells at the end of the line... It should remove the empty cells at the end of the line, such that the number of delimiters is the same for each line.

I don't understand. This sentence seems contradictory. The number of delimiters is the same for each line - same as in the header. But that means that there can be empty cells at the end of the line.

So the header has 9 cells. And each line should have 8 delimiters = 9 normal cells, or 5 normal cells and 4 empty, or 4 normal and 5 empty etc. And that's what you see in the figure you've sent - 6 filled cells and then 3 empty ones.

I can of course try to write a function that has 6 normal cells on the line and no empty cells. But if I remove empty cells, then there won't be the delimiters and there will only be 5 delimiters per line = that's not what we want. And probably tsv_write doesn't support this anyway.

Please check and say exactly what to do with empty cells/delimiters at the end of the line. Previous comments were unclear.

Still - all that was in the code before. Only the warnings and comments in code changed....

HENK: Yes you are right, now it works!

jan-petr · 2022-08-15T12:25:44Z

The function textscan treats a last empty cell (so a delimiter follow by line-end) as a non-existing cell (not empty, but completely skipping it). So "text,text,text EOL" is three cells (comma being a delimiter). "text,,,text EOL" is four cells. But "text, EOL" one cell. Though "text,,EOL" two cells.

So I've inserted an extra fix that adds an extra cell at the end of the line if it ends with a delimiter.

Now it all works.

jan-petr

OK.

HenkMutsaerts · 2022-08-15T17:22:59Z

Funnily, I think that none of the changes were necessary for the participants.tsv that I now worked with, except for your very last commit ...

HenkMutsaerts requested review from jan-petr and maartenhammer August 3, 2022 15:42

HenkMutsaerts linked an issue Aug 3, 2022 that may be closed by this pull request

spm_load #1140

Closed

8 tasks

HenkMutsaerts assigned maartenhammer Aug 3, 2022

jan-petr requested changes Aug 3, 2022

View reviewed changes

jan-petr self-requested a review August 4, 2022 11:57

jan-petr assigned jan-petr and unassigned maartenhammer Aug 4, 2022

HenkMutsaerts assigned maartenhammer and unassigned jan-petr Aug 5, 2022

jan-petr removed the request for review from maartenhammer August 15, 2022 07:47

jan-petr assigned jan-petr and unassigned maartenhammer Aug 15, 2022

jan-petr approved these changes Aug 15, 2022

View reviewed changes

HenkMutsaerts and others added 10 commits August 15, 2022 20:17

#1140 spm_load: Automatic fixing empty CSV/TSV cells

b89daa1

#1140 README_SPM.txt: Add spm_load edit

a37c572

#1140 spm_load: Count cells per line

c4e9a1b

#1140 SPM_LOAD: Fix line length issues

085fa54

#1140 SPM_LOAD: Remove unused code

48d13cd

#1140 SPM_LOAD: Minor fix

e619ebd

#1140 SPM_LOAD: Minor fix

2636a5e

#1140 xASL_tsvWrite: avoid last empty line

9a0fa2f

#1140 spm_load: cosmetics

22b094c

#1140 spm_load minor fix

52171bf

jan-petr force-pushed the feature-#1140_spm_load branch from 71eded8 to 52171bf Compare August 15, 2022 18:17

jan-petr merged commit 52171bf into develop Aug 15, 2022

jan-petr deleted the feature-#1140_spm_load branch August 15, 2022 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Closes #1140 spm load#1141

Closes #1140 spm load#1141
jan-petr merged 10 commits intodevelopfrom
feature-#1140_spm_load

HenkMutsaerts commented Aug 3, 2022

Uh oh!

jan-petr left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HenkMutsaerts commented Aug 4, 2022

Uh oh!

jan-petr commented Aug 4, 2022 •

edited by HenkMutsaerts

Loading

Uh oh!

jan-petr commented Aug 4, 2022 •

edited by HenkMutsaerts

Loading

Uh oh!

jan-petr commented Aug 15, 2022

Uh oh!

jan-petr left a comment

Uh oh!

HenkMutsaerts commented Aug 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

HenkMutsaerts commented Aug 3, 2022

Linked issue

How to test

Comments

Uh oh!

jan-petr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HenkMutsaerts commented Aug 4, 2022

Uh oh!

jan-petr commented Aug 4, 2022 • edited by HenkMutsaerts Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jan-petr commented Aug 4, 2022 • edited by HenkMutsaerts Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jan-petr commented Aug 15, 2022

Uh oh!

jan-petr left a comment

Choose a reason for hiding this comment

Uh oh!

HenkMutsaerts commented Aug 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jan-petr commented Aug 4, 2022 •

edited by HenkMutsaerts

Loading

jan-petr commented Aug 4, 2022 •

edited by HenkMutsaerts

Loading