Pathology Masked Inference WSI Dataset#1869
Pathology Masked Inference WSI Dataset#1869bhashemian merged 31 commits intoProject-MONAI:masterfrom
Conversation
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Nic-Ma
left a comment
There was a problem hiding this comment.
Put minor comments.
Others look good to me.
Thanks.
Signed-off-by: Behrooz <[email protected]>
wyli
left a comment
There was a problem hiding this comment.
thanks! I put some comments inline, mainly asking for some docstring/variable name updates
monai/apps/pathology/datasets.py
Outdated
|
|
||
| class MaskedInferenceWSIDataset(Dataset): | ||
| """ | ||
| This dataset load the provided tissue masks at an arbitrary resolution level, |
There was a problem hiding this comment.
just for clarification, could you revise the terminology in this file, there are data, sample, patch, region, sub_region. what's the difference between mask_location and image_location?
There was a problem hiding this comment.
sample is one of the data. We first extract region from a WSI, then extract multiple patches from that region based on the grid_shape. I'll remove sub_region.
mask and image have different resolution level, so their location, although related, are not the same.
There was a problem hiding this comment.
so, what are the differences in data, dataset, WSI? could you please document these, so that it'll be easier to maintain for the other developers
There was a problem hiding this comment.
@wyli, I modified a bit the naming. Now we have self.data like other Datasets, which is a collection of samples, and we have the hierarchy of data > sample > patch.
WSI is whole slide image and we don't have a variable specifically called WSI, do we?!
It is only represented in the class names and variables like wsi_object_dict, which is a dictionary of either OpenSlide or CuCIM objects.
And we don't have dataset variable here but I general refer to an instance of Dataset as dataset.
There was a problem hiding this comment.
please update the docstring. it's also about explaining this in the code, to help the other developers. "sample is one of the data. We first extract region from a WSI" is already confusing...
There was a problem hiding this comment.
In MONAI's Dataset data is Sequence of something, what do you call it? I called it here sample.
Extracting a region from a whole slide image is very common in digital pathology, what part of it shoud I explane more?
There was a problem hiding this comment.
could you put a paper reference here? if that could help clarify the logic. Perhaps we could have comments from Ziyue?
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
|
@wyli, could you please check if all your comments are addressed? Have I missed anything? |
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
Signed-off-by: Behrooz <[email protected]>
wyli
left a comment
There was a problem hiding this comment.
thanks! sorry I could have suggested this earlier, perhaps code + a paper reference, everything becomes self-contained
@wyli, no problem, just could you please let me know which part of the code you think a paper reference would be appropriate? |
the concept of sampling data into region/grid/patch representations is used throughout this module, would be nice if we could cite any of the existing papers that uses this topology |
That makes sense! I'll find a paper and will add it. |
Description
This PR implements a dataset for pathology inference. It uses tissue foreground masks to extract patches from whole slide images and provide necessary meta data for post-processing of inference results.
Status
Ready
Types of changes
./runtests.sh -f -u --net --coverage../runtests.sh --quick --unittests.make htmlcommand in thedocs/folder.