Extended Presentation API Investigation

## Context

I'm working on frame pacing and we need some help from the api. The difficulty of designing this api is each WSI has different pieces of information and give it to us in different ways.

Supersedes #682 
Supersedes #2650

## Investigation

We have the following major WSIs to think about:

- IDXGISwapchain (Windows 7+ - D3D)
- IPresentationManager (Windows 11+ - D3D)
- CAMetalLayer (Mac - Metal)
- VK_GOOGLE_display_timing (Vulkan - Android)
- VK_KHR_present_wait (Vulkan - Nvidia)
- VK_KHR_incremental_present (Mainly Mesa/Android)
- VK_KHR_swapchain (All Vulkan)

And we have the following primatives:
- Get Present Start/End Time
- Wait for Present Finish
- Present with Damage
- Schedule Present Time
- Primary Monitor Frequency

<table>
<tr>
	<td>
	<td> Present Time
	<td> Wait for Present
	<td> Present with Damage
	<td> Scheduled Present
    <td> Monitor Frequency
<tr>
	<td> IDXGISwapchain
	<td> 🆗 (1a)
	<td> ✅<del>2</del>
	<td> ✅ (3)
	<td> 🆗 (4)
	<td> ✅
<tr>
	<td> IPresentationManager
	<td> ✅ (1b)
	<td> ✅ 
	<td> ❌ 
	<td> ✅ 
	<td> ❌ 
<tr>
	<td> CAMetalLayer
	<td> ✅ (1c)
	<td> ✅ 
	<td> ❌ 
	<td> ✅ 
	<td> ✅ (5)
<tr>
	<td> VK_GOOGLE_display_timing
	<td> ✅ 
	<td> ❌ 
	<td> ❌ 
	<td> ✅ 
	<td> ✅
<tr>
	<td> VK_KHR_present_wait
	<td> ❌ 
	<td> ✅ 
	<td> ❌ 
	<td> ❌ 
	<td> ❌ 
<tr>
	<td> VK_KHR_incremental_present
	<td> ❌ 
	<td> ❌ 
	<td> ✅ 
	<td> ❌ 
	<td> ❌ 
<tr>
	<td> VK_KHR_swapchain
	<td> ❌ 
	<td> ❌ 
	<td> ❌
	<td> ❌ 
	<td> ❌ 
</table>

Notes:
1a. Presentation times need to be queried actively, it doesn't get told to us.
1b. Presentation times are given through an event queue.
1c. Presentation times are given through callbacks.
~2. Can only wait for 1-3 frames ago, not a particular frame.~
3. Windows 8+/Windows 7 Platform Update
4. You can schedule presentation for N vblanks from the previous presentation.
5. Via NSScreen - need to figure out how to get NSScreen from metal layer.

Because of the diversity of the platforms, I think this will inherently be a leaky abstraction - this is okay - we shouldn't try to hide platform differences, just make it as easy to use as possible.

As such I have put together the following api.

## Api Suggestion

### Feature

First is to add a single Feature.

```rust
const EXTENDED_PRESENTATION_FEATURES = ...;
```

### Presentation Features

Add an extended presentation capabilities bitflag that is queryable from the surface. I am separating this from regular features because they are more useful as default-on. Having the single feature means that users have to consciously enable it, but without needing to individually modulate them.

```rust
fn Surface::get_extended_presentation_features(&self, &Adapter) -> ExtendedPresentationFeatures;

bitflags! {
    // Names bikeshedable
    struct ExtendedPresentationFeatures {
        const PRESENT_STATISTICS = 1 << 0;
        const MONITOR_STATISTICS = 1 << 1;
        const WAIT_FOR_PRESENTATION = 1 << 2;
        const PRESENT_DAMAGE_REGION = 1 << 3;
        const PRESENT_DAMAGE_SCOLL = 1 << 4;
        const PRESENT_TIME = 1 << 5;
        const PRESENT_VBLANK_COUNT = 1 << 6;
    }
}
```

### Presentation Signature

The presentation signature will be changed to the following.

```rust
fn Surface::present(desc: PresentationDescriptor<'a>);

#[derive(Default)] // Normal presentations will be PresentationDescriptor::default()
struct PresentationDescriptor<'a> {
    // Must be zero-length if PRESENT_DAMAGE_REGION is not true
    rects: &'a [Rect],
    // Must be None if PRESENT_DAMAGE_SCOLL is not true
    scroll: Option<PresentationScoll>,
    // Must be NoDelay if PRESENT_TIME or PRESENT_VBLANK_COUNT if not true
    presentation_delay: PresentationDelay,
}

struct PresentationScroll {
    source_rect: Rect,
    offset: Vec2,
}

struct Rect {
    offset: Vec2,
    size: Vec2,
}

enum PresentationDelay {
    // Queue the frame immediately. 
    NoDelay,
    // Queue the frame for N vblanks from now (must be between 1 and 4). Needs PRESENT_VBLANK_COUNT.
    ScheduleVblank(u8)
    // Queue the frame for presentation at the given time. Needs PRESENT_TIME.
    ScheduleTime(PresentationTime)
}
```

### Presentation Timestamp

Because different apis use different timestamps - we need a way of correlating these timestamps with various other clocks. The clocks used are as follows on each WSI:

<table>
<tr>
	<td>WSI
	<td>Clock
<tr>
	<td>IDXGISwapchain
	<td><a href="https://docs.microsoft.com/en-us/windows/win32/api/profileapi/nf-profileapi-queryperformancecounter">QueryPerformanceCounter</a>
<tr>
	<td>IPresentationManager
	<td><a href="https://docs.microsoft.com/en-us/windows/win32/api/realtimeapiset/nf-realtimeapiset-queryinterrupttimeprecise">QueryInterruptTimePrecise</a>
<tr>
	<td>CAMetalLayer
	<td><a href="https://developer.apple.com/documentation/kernel/1462446-mach_absolute_time">mach_absolute_time</a>
<tr>
	<td>VK_GOOGLE_display_timing
	<td><a href="https://linux.die.net/man/3/clock_gettime">clock_gettime(CLOCK_MONOTONIC)</a>
</table>

Add the following function to the surface.

```rust
fn Surface::correlate_presentation_timestamp<F, T>(&self, &Adapter, F)  -> (PresentationTimestamp, T) where FnOnce() -> T;

// Unit: nanoseconds
struct PresentationTimestamp(pub u64);
```

Which will let people write the following code to correlate instants and presentation timestamps. We need this because `Instant`s need to be treated as completely opaque as the clock they use can change at any time. In most cases these are actually the same clock, but this is what we get.

```rust
let (present_timestamp, now) = surface.correlate_presentation_timestamp(&adapter, Instance::now);
```

### Presentation Statistics

Because of the difference in how all the apis query stats, we need to abstract this carefully. We use a query-based "presentation statistics queue".

- CAMetalLayer: Callbacks will save the time into a queue, which is emptied every time it is queried.
- IPresentationManager: Calling the query function drains the statistics queue.
- IDXGI: Query calls GetPresentationStatistics and returns a single value.
- VK_GOOGLE_present_timing: Calls `vkGetPastPresentationTimingGOOGLE` which drains the queue.

```rust
fn Surface::query_presentation_statistics(&self, &Device) -> Vec<PresentationStatistics>;

struct PresentationStatistics {
    presentation_start: PresentationTimestamp,
    // Only available on IPresentationManager
    presentation_end: Option<PresentationTimestamp>,
    // Only available on VK_GOOGLE_display_timing
    earliest_present_time: Option<PresentationTimestamp>,
    // Only available on VK_GOOGLE_display_timing
    presentation_margin: Option<PresentationTimestamp>,
    composition_type: CompositionType,
}

enum CompositionType {
    // CAMetalLayer is always Composed
    Composed,
    Independent,
    // Vulkan, DXGI is always unknown
    Unknown,
}
```

### Presentation Wait

First add the following member to SurfaceConfiguration:

```rust
// Requires WAIT_FOR_PRESENTATION and must be between 1 and 2.
maximum_latency: Option<u8>
```

This adjusts either the swapchain frame count to `value + 1` - or sets SetMaximumFrameLatency to the value given, or uses a wait-for-present in the acquire method to limit rendering such that it acts like it's a `value + 1` swapchain frame set.

### Monitor Information

Getting exact frequencies of monitors is important for pacing - they can be derived from presentation stats, but an explicit api is more precise if it is available.

```rust
fn Surface::query_monitor_statistics(&self, &Device) -> MonitorStatistics;

struct MonitorStatistics {
    // In nanoseconds
    min_refresh_interval: u64,
    max_refresh_interval: u64,
    // On available on CAMetalLayer
    display_update_granularity: u64,
}
```

## Conclusion

This is obviously one hell of an api change, and this doesn't have to happen all at once, but this investigating should give us the place to discuss the changes and make sure it provides the information needed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extended Presentation API Investigation #2869

Context

Investigation

Api Suggestion

Feature

Presentation Features

Presentation Signature

Presentation Timestamp

Presentation Statistics

Presentation Wait

Monitor Information

Conclusion

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	Present Time	Wait for Present	Present with Damage	Scheduled Present	Monitor Frequency
IDXGISwapchain	🆗 (1a)	✅2	✅ (3)	🆗 (4)	✅
IPresentationManager	✅ (1b)	✅	❌	✅	❌
CAMetalLayer	✅ (1c)	✅	❌	✅	✅ (5)
VK_GOOGLE_display_timing	✅	❌	❌	✅	✅
VK_KHR_present_wait	❌	✅	❌	❌	❌
VK_KHR_incremental_present	❌	❌	✅	❌	❌
VK_KHR_swapchain	❌	❌	❌	❌	❌

WSI	Clock
IDXGISwapchain	QueryPerformanceCounter
IPresentationManager	QueryInterruptTimePrecise
CAMetalLayer	mach_absolute_time
VK_GOOGLE_display_timing	clock_gettime(CLOCK_MONOTONIC)

Extended Presentation API Investigation #2869

Description

Context

Investigation

Api Suggestion

Feature

Presentation Features

Presentation Signature

Presentation Timestamp

Presentation Statistics

Presentation Wait

Monitor Information

Conclusion

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions