Add Left-Right Planarity Test Algorithm to NetworKit by Schwarf · Pull Request #1276 · networkit/networkit

Schwarf · 2025-01-08T08:43:58Z

Overview

This PR introduces an implementation of the Left-Right Planarity Test algorithm, which is used to determine whether a given graph is planar. The implementation was guided by the paper "The Left-Right Planarity Test" as well as the corresponding implementation in NetworkX.

Unit Tests

To ensure correctness and robustness, I have included 18 unit tests for the algorithm. However, given the complexity of the problem, I would be happy to add more test cases with known outcomes if provided or suggested.

Graph API Extension

I added a new method, sortNeighbors, to the Graph class. This method sorts the neighbors of a graph node based on a binary predicate, and I have included a corresponding unit test for this functionality. However, it’s possible that this could be achieved with existing APIs, and I would appreciate guidance if such an alternative already exists.

Notes

This PR represents my first feature contribution to NetworKit. There is likely room for optimization, and I welcome any feedback or suggestions for improvement.

I look forward to your review and guidance to help refine this contribution.

…t pairs.

…ng of iterators.

…e default constructor of Edge class constepxr.

angriman

Thank you for your contribution! I left some comments, most of them are just minor style changes to be consistent with the rest of the codebase.

include/networkit/graph/Graph.hpp

include/networkit/planarity/LeftRightPlanarityCheck.hpp

networkit/cpp/planarity/test/LeftRightPlanarityCheckGTest.cpp

Schwarf · 2025-01-19T14:39:49Z

Hi @angriman,

Thank you for your feedback! I’d like to provide some context and arguments for the choice of graph sizes in the tests and the use of many test cases:

Multiple and Larger Graphs
During the initial implementation, I relied on smaller, single-graph tests, which passed successfully. However, when I extended the tests to include many graphs of the same class (e.g., multiple grid graphs instead of a single 3x3 grid graph), some cases emerged, revealing bugs in the algorithm. Additionally, larger and random graphs helped me uncover issues that smaller graphs missed. These are the main reasons I included these tests.
Algorithm extension
I plan to extend my contribution by modifying the algorithm to provide planar embeddings for planar graphs. This may require minor refactoring of the current implementation. Similarly, potential performance improvements might also lead to refactoring. Having a variety of test cases (including larger and multiple graphs) will make it easier for me and others to ensure correctness during these changes.
Negligible Runtime Cost
On my machine, the runtime difference between testing a single graph versus hundreds of graphs is of the order of a few milliseconds, which is negligible compared to tests that take up to 10 seconds. The same applies to the included large random graphs. While I understand the importance of keeping tests efficient ("Kleinvieh macht auch Mist"), I currently believe the benefits of including larger and multiple graphs, outweigh the costs.

Given these reasons, I’d appreciate it if we could retain the current test cases in the file LeftRightPlanarityCheckGTest.cpp. Of course, I’m happy to hear your thoughts.

What do you think?

fabratu · 2025-01-20T12:09:29Z

Hi @angriman,

Thank you for your feedback! I’d like to provide some context and arguments for the choice of graph sizes in the tests and the use of many test cases:

1. Multiple and Larger Graphs
   During the initial implementation, I relied on smaller, single-graph tests, which passed successfully. However, when I extended the tests to include many graphs of the same class (e.g., multiple grid graphs instead of a single 3x3 grid graph), some cases emerged, revealing bugs in the algorithm. Additionally, larger and random graphs helped me uncover issues that smaller graphs missed. These are the main reasons I included these tests.

2. Algorithm extension
   I plan to extend my contribution by modifying the algorithm to provide planar embeddings for planar graphs. This may require minor refactoring of the current implementation. Similarly, potential performance improvements might also lead to refactoring. Having a variety of test cases (including larger and multiple graphs) will make it easier for me and others to ensure correctness during these changes.

3. Negligible Runtime Cost
   On my machine, the runtime difference between testing a single graph versus hundreds of graphs is of the order of a few milliseconds, which is negligible compared to tests that take up to 10 seconds. The same applies to the included large random graphs. While I understand the importance of keeping tests efficient ("Kleinvieh macht auch Mist"), I currently believe the benefits of including larger and multiple graphs, outweigh the costs.

Given these reasons, I’d appreciate it if we could retain the current test cases in the file LeftRightPlanarityCheckGTest.cpp. Of course, I’m happy to hear your thoughts.

What do you think?

There are positives and negatives about having random generated graphs for testing. In the past it helped us discover some quite subtle bugs. However, sometimes these can come from the generators itself, leading to the case that a test implicitly checks the functionalities of different code units. Also you can not predict when these errors turn up. Overall there are better systems to generate these tests, but these would need a complete pipeline overhaul.

We had more bugs in algorithms / core functionalities which only use a pre-defined set of graphs, since these are often too narrow to cover all corner-cases and are tailored around the original use-cases of the algorithm. Therefore I would not drop the strategy of testing random graphs.

It is true though that this one algorithm does incorporate a lot of generated graphs. While fast for now, running times can explode when new features are encoded or data structures change. I would therefore suggest to drop some redundancy / cases. As a rule of thumb, most algorithms don't test for more than 10 different graphs.

Schwarf · 2025-01-20T16:46:05Z

Hi @angriman,
Thank you for your feedback! I’d like to provide some context and arguments for the choice of graph sizes in the tests and the use of many test cases:
1. Multiple and Larger Graphs
   During the initial implementation, I relied on smaller, single-graph tests, which passed successfully. However, when I extended the tests to include many graphs of the same class (e.g., multiple grid graphs instead of a single 3x3 grid graph), some cases emerged, revealing bugs in the algorithm. Additionally, larger and random graphs helped me uncover issues that smaller graphs missed. These are the main reasons I included these tests.

2. Algorithm extension
   I plan to extend my contribution by modifying the algorithm to provide planar embeddings for planar graphs. This may require minor refactoring of the current implementation. Similarly, potential performance improvements might also lead to refactoring. Having a variety of test cases (including larger and multiple graphs) will make it easier for me and others to ensure correctness during these changes.

3. Negligible Runtime Cost
   On my machine, the runtime difference between testing a single graph versus hundreds of graphs is of the order of a few milliseconds, which is negligible compared to tests that take up to 10 seconds. The same applies to the included large random graphs. While I understand the importance of keeping tests efficient ("Kleinvieh macht auch Mist"), I currently believe the benefits of including larger and multiple graphs, outweigh the costs.
Given these reasons, I’d appreciate it if we could retain the current test cases in the file LeftRightPlanarityCheckGTest.cpp. Of course, I’m happy to hear your thoughts.
What do you think?
There are positives and negatives about having random generated graphs for testing. In the past it helped us discover some quite subtle bugs. However, sometimes these can come from the generators itself, leading to the case that a test implicitly checks the functionalities of different code units. Also you can not predict when these errors turn up. Overall there are better systems to generate these tests, but these would need a complete pipeline overhaul.

We had more bugs in algorithms / core functionalities which only use a pre-defined set of graphs, since these are often too narrow to cover all corner-cases and are tailored around the original use-cases of the algorithm. Therefore I would not drop the strategy of testing random graphs.

It is true though that this one algorithm does incorporate a lot of generated graphs. While fast for now, running times can explode when new features are encoded or data structures change. I would therefore suggest to drop some redundancy / cases. As a rule of thumb, most algorithms don't test for more than 10 different graphs.

Hi @farbratu,

Thank you for your detailed feedback! I have to admit that I might not have fully grasped all the concerns raised by @angriman. Thank you for providing additional context and suggestions.

I agree with your suggestion to reduce redundancy and limit the number of graphs tested for each type. I’ll adjust the tests to include around 10 graphs per type and in some cases, perhaps up to 20.
Regarding the use of random graphs: these are programmatically generated. I will reduce their size from 50–100 nodes to a more manageable 10–20 nodes to ensure the tests remain efficient while still being diverse.

Before I proceed, is the above proposal acceptable for you guys?

angriman · 2025-01-20T19:46:49Z

Hi @angriman,
Thank you for your feedback! I’d like to provide some context and arguments for the choice of graph sizes in the tests and the use of many test cases:
1. Multiple and Larger Graphs
   During the initial implementation, I relied on smaller, single-graph tests, which passed successfully. However, when I extended the tests to include many graphs of the same class (e.g., multiple grid graphs instead of a single 3x3 grid graph), some cases emerged, revealing bugs in the algorithm. Additionally, larger and random graphs helped me uncover issues that smaller graphs missed. These are the main reasons I included these tests.

2. Algorithm extension
   I plan to extend my contribution by modifying the algorithm to provide planar embeddings for planar graphs. This may require minor refactoring of the current implementation. Similarly, potential performance improvements might also lead to refactoring. Having a variety of test cases (including larger and multiple graphs) will make it easier for me and others to ensure correctness during these changes.

3. Negligible Runtime Cost
   On my machine, the runtime difference between testing a single graph versus hundreds of graphs is of the order of a few milliseconds, which is negligible compared to tests that take up to 10 seconds. The same applies to the included large random graphs. While I understand the importance of keeping tests efficient ("Kleinvieh macht auch Mist"), I currently believe the benefits of including larger and multiple graphs, outweigh the costs.
Given these reasons, I’d appreciate it if we could retain the current test cases in the file LeftRightPlanarityCheckGTest.cpp. Of course, I’m happy to hear your thoughts.
What do you think?
There are positives and negatives about having random generated graphs for testing. In the past it helped us discover some quite subtle bugs. However, sometimes these can come from the generators itself, leading to the case that a test implicitly checks the functionalities of different code units. Also you can not predict when these errors turn up. Overall there are better systems to generate these tests, but these would need a complete pipeline overhaul.
We had more bugs in algorithms / core functionalities which only use a pre-defined set of graphs, since these are often too narrow to cover all corner-cases and are tailored around the original use-cases of the algorithm. Therefore I would not drop the strategy of testing random graphs.
It is true though that this one algorithm does incorporate a lot of generated graphs. While fast for now, running times can explode when new features are encoded or data structures change. I would therefore suggest to drop some redundancy / cases. As a rule of thumb, most algorithms don't test for more than 10 different graphs.
Hi @farbratu,

Thank you for your detailed feedback! I have to admit that I might not have fully grasped all the concerns raised by @angriman. Thank you for providing additional context and suggestions.

I agree with your suggestion to reduce redundancy and limit the number of graphs tested for each type. I’ll adjust the tests to include around 10 graphs per type and in some cases, perhaps up to 20.

Regarding the use of random graphs: these are programmatically generated. I will reduce their size from 50–100 nodes to a more manageable 10–20 nodes to ensure the tests remain efficient while still being diverse.

Before I proceed, is the above proposal acceptable for you guys?

I agree that testing on random graphs is very helpful for finding bugs in the algorithm you are developing (I've used them a lot in the past). However, I think that, when possible, they should not be part of unit tests because of the following reasons (more about testing best practices can be found in [1]):

Reproducibility: random tests can be flaky when tested on different architectures. It is important to have reproducible tests when possible.
If they fail, they don't help you understand what in the algorithm is not behaving as intended.

My recommendation is to test on a fixed and representative set of instances to evaluate that the algorithm behaves as intended. I understand that sometimes this is easier said than done because the algorithm may be complex. If that's the case, we can keep tests on random graphs.

Regarding large graphs: it's ok to have individual tests on larger instances (you can also use those in the input/ directory in NetworKit). What I'm against is writing a unit test with 50+ lines used for graph edges, which makes it impossible for a future reader to understand what's going on. Why do you need do define such a large graph? What kind of behavior are you trying to test that cannot be tested with a smaller instance?

If you just need to test your algorithm on a larger graph, then my recommendation is to replace the custom graphs with 50+ nodes (testRandomGeneratedPlanarGraph50/51/100Nodes) with graphs in the input/ directory.

Let me know if you have further questions!

[1] https://abseil.io/resources/swe-book/html/ch12.html

Schwarf · 2025-01-21T17:07:54Z

Hi @angriman,
Thank you for your feedback! I’d like to provide some context and arguments for the choice of graph sizes in the tests and the use of many test cases:
1. Multiple and Larger Graphs
   During the initial implementation, I relied on smaller, single-graph tests, which passed successfully. However, when I extended the tests to include many graphs of the same class (e.g., multiple grid graphs instead of a single 3x3 grid graph), some cases emerged, revealing bugs in the algorithm. Additionally, larger and random graphs helped me uncover issues that smaller graphs missed. These are the main reasons I included these tests.

2. Algorithm extension
   I plan to extend my contribution by modifying the algorithm to provide planar embeddings for planar graphs. This may require minor refactoring of the current implementation. Similarly, potential performance improvements might also lead to refactoring. Having a variety of test cases (including larger and multiple graphs) will make it easier for me and others to ensure correctness during these changes.

3. Negligible Runtime Cost
   On my machine, the runtime difference between testing a single graph versus hundreds of graphs is of the order of a few milliseconds, which is negligible compared to tests that take up to 10 seconds. The same applies to the included large random graphs. While I understand the importance of keeping tests efficient ("Kleinvieh macht auch Mist"), I currently believe the benefits of including larger and multiple graphs, outweigh the costs.
Given these reasons, I’d appreciate it if we could retain the current test cases in the file LeftRightPlanarityCheckGTest.cpp. Of course, I’m happy to hear your thoughts.
What do you think?
There are positives and negatives about having random generated graphs for testing. In the past it helped us discover some quite subtle bugs. However, sometimes these can come from the generators itself, leading to the case that a test implicitly checks the functionalities of different code units. Also you can not predict when these errors turn up. Overall there are better systems to generate these tests, but these would need a complete pipeline overhaul.
We had more bugs in algorithms / core functionalities which only use a pre-defined set of graphs, since these are often too narrow to cover all corner-cases and are tailored around the original use-cases of the algorithm. Therefore I would not drop the strategy of testing random graphs.
It is true though that this one algorithm does incorporate a lot of generated graphs. While fast for now, running times can explode when new features are encoded or data structures change. I would therefore suggest to drop some redundancy / cases. As a rule of thumb, most algorithms don't test for more than 10 different graphs.
Hi @farbratu,
Thank you for your detailed feedback! I have to admit that I might not have fully grasped all the concerns raised by @angriman. Thank you for providing additional context and suggestions.

I agree with your suggestion to reduce redundancy and limit the number of graphs tested for each type. I’ll adjust the tests to include around 10 graphs per type and in some cases, perhaps up to 20.

Regarding the use of random graphs: these are programmatically generated. I will reduce their size from 50–100 nodes to a more manageable 10–20 nodes to ensure the tests remain efficient while still being diverse.

Before I proceed, is the above proposal acceptable for you guys?
I agree that testing on random graphs is very helpful for finding bugs in the algorithm you are developing (I've used them a lot in the past). However, I think that, when possible, they should not be part of unit tests because of the following reasons (more about testing best practices can be found in [1]):
* Reproducibility: random tests can be flaky when tested on different architectures. It is important to have reproducible tests when possible.

* If they fail, they don't help you understand what in the algorithm is not behaving as intended.
My recommendation is to test on a fixed and representative set of instances to evaluate that the algorithm behaves as intended. I understand that sometimes this is easier said than done because the algorithm may be complex. If that's the case, we can keep tests on random graphs.

Regarding large graphs: it's ok to have individual tests on larger instances (you can also use those in the input/ directory in NetworKit). What I'm against is writing a unit test with 50+ lines used for graph edges, which makes it impossible for a future reader to understand what's going on. Why do you need do define such a large graph? What kind of behavior are you trying to test that cannot be tested with a smaller instance?

If you just need to test your algorithm on a larger graph, then my recommendation is to replace the custom graphs with 50+ nodes (testRandomGeneratedPlanarGraph50/51/100Nodes) with graphs in the input/ directory.

Let me know if you have further questions!

[1] https://abseil.io/resources/swe-book/html/ch12.html

Thank you for the detailed explanation of your concerns. I considered the large graphs of the input folder before but shied away from the effort to independently determine if they are planar or not. I will go ahead and try while removing the "random graphs" from the unit tests. For the "typed graphs" I will reduce the number of generated graphs as proposed above and reorganize the tests to focus on specific behaviors. Thank your for the link to the article. It was a great read and a helpful reminder of best practices.

…esults checked with independent tools.

angriman

Thank you for addressing the comments, I left a few minor ones, the rest LGTM.

include/networkit/planarity/LeftRightPlanarityCheck.hpp

networkit/cpp/graph/test/GraphGTest.cpp

networkit/cpp/planarity/LeftRightPlanarityCheck.cpp

fabratu · 2025-01-28T08:42:02Z

A side-comment: The error in the pipeline for the aarch64 wheel build is fixed in PR #1283 and can be ignored concerning the finish state of this PR.

angriman · 2025-01-28T16:25:54Z

There are still unresolved comments (e.g., std::ranges::is_sorted vs std::is_sorted), did you forget to push some commits?

Schwarf · 2025-01-28T16:40:15Z

There are still unresolved comments (e.g., std::ranges::is_sorted vs std::is_sorted), did you forget to push some commits?

Sorry. They should all be marked as Outdated. I will go through and resolve them.

fabratu self-assigned this Jan 8, 2025

Schwarf added 29 commits January 13, 2025 17:10

Add CMakeLists infrastructure for planarity.

b5db857

Start with needed data structures.

05e2cb7

Add definitions for Interval and ConflicPair. Define stack of conflic…

9720eda

…t pairs.

Implement dfs orientation.

0d00fd8

Add dfsTesting implementation.

198e859

Add sorting according to nestingDepth.

81ac534

Finish run method.

0f8ef7f

Add implementation for applyConstraints.

794f121

Add removeBackEdges and finish first implementation.

653cd6b

Minor cleanup.

bf6a6c0

WIP: NeigborIteratorBase does not support +/- operators and no swappi…

fe80f42

…ng of iterators.

Use sortEdges method to sort according to nestingDepth.

dd41d03

Fix bugs regarding default values of hashmaps.

fba5a9f

Add sortNeighbors method to Graph class. First simple test passes.

40ec41f

Add corner cases for no nodes and single node.

ac7c085

Add simple tests for star-, cycle-graphs as well as binary-trees.

130a0cb

Add simple tests using wheel-graphs.

c075759

Add test with grid graph.

7b38ab2

Add tests using complete graphs.

0aaad8f

Add tests for one planar and one no-planar generalized petersen graph.

b48ce57

Fix first debug findings.

ffba3f8

Fix iterator bug.

0827d04

Generalize grid-graph test.

11b76dd

Add some random generated planar graphs.

11e3da5

Finish first tests.

237feae

Minor code cleanup.

f1cad03

Add test for new public Graph-method sortNeighbors.

d1f129d

Apply clang-format.

eec16ac

Move ConflictPair and Interval into class LeftRightPlanarityTest. Mak…

5d6dc7c

…e default constructor of Edge class constepxr.

angriman requested changes Jan 19, 2025

View reviewed changes

Schwarf added 8 commits January 23, 2025 15:54

Start with 2nd review findings.

517dfd3

Undo constexpr Edge constructor.

d202e05

Add tests involving large graphs (2 planar, 2 non-planar). Expected r…

75b0a84

…esults checked with independent tools.

Refactor code in dfsTesting.

12450ac

Fix missing dependency.

df691a5

Refactor and move sortNeighbors method.

ba899f2

Fix bug in sortNeighbors.

885b614

Finish test-splits for sortNeighbors.

ea7412b

Schwarf requested review from angriman and fabratu January 26, 2025 13:05

angriman previously approved these changes Jan 27, 2025

View reviewed changes

Incorporate findings of follow-up review.

0696fc6

Schwarf dismissed angriman’s stale review via 0696fc6 January 28, 2025 14:26

Schwarf requested a review from angriman January 28, 2025 14:28

angriman approved these changes Jan 28, 2025

View reviewed changes

fabratu approved these changes Feb 4, 2025

View reviewed changes

fabratu merged commit ce0a4c7 into networkit:master Feb 4, 2025

fabratu removed their assignment Feb 4, 2025

Schwarf mentioned this pull request Feb 8, 2025

Refactor sortNeighbors with OpenMP Parallelization & Evaluate Reuse in sortEdges #1303

Open

Schwarf mentioned this pull request Dec 6, 2025

Speed up LeftRightPlanarityCheck by replacing hash maps with vector-based storage #1377

Merged

Schwarf mentioned this pull request Dec 14, 2025

Add python binding for left-right-planarity-check #1382

Merged

Conversation

Schwarf commented Jan 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Unit Tests

Graph API Extension

Notes

Uh oh!

angriman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Schwarf commented Jan 19, 2025

Uh oh!

fabratu commented Jan 20, 2025

Uh oh!

Schwarf commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

angriman commented Jan 20, 2025

Uh oh!

Schwarf commented Jan 21, 2025

Uh oh!

angriman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fabratu commented Jan 28, 2025

Uh oh!

angriman commented Jan 28, 2025

Uh oh!

Schwarf commented Jan 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Schwarf commented Jan 8, 2025 •

edited

Loading

Schwarf commented Jan 20, 2025 •

edited

Loading

Schwarf commented Jan 28, 2025 •

edited

Loading