Skip to content

New tutorial for 'pdal tindex create' and readers.tindex #4550

Closed
ibell13 wants to merge 4 commits intoPDAL:masterfrom
ibell13:tindex-tutorial
Closed

New tutorial for 'pdal tindex create' and readers.tindex #4550
ibell13 wants to merge 4 commits intoPDAL:masterfrom
ibell13:tindex-tutorial

Conversation

@ibell13
Copy link
Contributor

@ibell13 ibell13 commented Nov 14, 2024

Made a new tutorial for some PDAL tile index workflows:

  • creating a tile index with an S3-hosted tiled copc dataset, using pdal tindex create
  • using readers.tindex to read the index where it intersects certain polygons, and writing a raster of HeightAboveGround for each.

When #4548 gets implemented, it could make some of the garbage I do to feed WKT into readers.tindex unnecessary; but right now I think that's the simplest way to do it.

Suggestions for clarification or improvement are welcome!

commit 3a91939
Author: ibell13 <[email protected]>
Date:   Thu Nov 14 14:09:04 2024 -0600

    add link in tutorial

commit 3450dda
Author: ibell13 <[email protected]>
Date:   Wed Nov 13 12:43:23 2024 -0600

    add parquet format

commit cc845ac
Author: ibell13 <[email protected]>
Date:   Tue Nov 12 14:55:41 2024 -0600

    tutorial - add images

commit 5c2e4c6
Author: ibell13 <[email protected]>
Date:   Tue Nov 12 14:27:09 2024 -0600

    tindex tutorial - draft 2

commit fe33e2d
Author: ibell13 <[email protected]>
Date:   Thu Nov 7 23:26:49 2024 -0600

    initial powershell additions

commit ffbc849
Author: ibell13 <[email protected]>
Date:   Mon Nov 4 16:54:00 2024 -0600

    _toc.yml: link tutorial

commit 3e81991
Author: ibell13 <[email protected]>
Date:   Mon Nov 4 16:40:35 2024 -0600

    new TIndex tutorial - draft
implementation of the Apache Parquet format.

`tindex create` can take input from stdin (the -s flag) or a glob pattern
(the --filespec option). Filespec can read files from a local directory, using
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

put CLI arguments in <pre> sections.

pdal tindex create WI_Adams.gpkg -s < files.txt
```

For each of these files, an internal filter (a version of {ref}`filters.hexbin`)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why does the user care about an "internal filter"?

$ pdal tindex create -s --tindex "WI_Adams_2019.parquet" -f Parquet \
--readers.copc.threads=10 --readers.copc.resolution=10 \
--lyr_name "WI_Adams_2019" --threads=8 --edge_length=20 \
< files.txt
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instead of files.txt, maybe have the user fetch the URLs from github directly via curl or something. The idea is to give them something they can just copy/paste and have working.

--lyr_name "WI_Adams_2019" --threads=8 --edge_length=20 \
< files.txt
```
```{note}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Update this comment to state something to the effect that most of these threads are waiting for i/o over the network in our scenario, so it is possible to use quite a few of them. If you were running on local data, lots of threads might cause contention because most of them would be doing work instead of waiting on i/o.

@hobu hobu added this to the 2.8.3 milestone Dec 12, 2024
@hobu hobu mentioned this pull request Dec 23, 2024
@hobu
Copy link
Member

hobu commented Jan 10, 2025

picked up in #4599

@hobu hobu closed this Jan 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants