Questions about Stream level consumption model

Issue is to open-up a discussion around:
1. For which use-cases does it make sense to use the stream level consumption model in Pinot ?
2. What are the semantics offered by the stream level consumption model. Eg. how does data from the source get partitioned into Pinot tables ? How is the consumption monitored in this model?  Iiuc, segment name convention is also different? 
3.  Some feature differences I have noticed are (please correct, if I am mistaken). I am sure there are more. 

 Feature | HLC | LLC 
---|---|---
Force commit | No | Yes 
Stream Message metadata extraction | No (can potentially be extended) | Yes 
Ingestion throttling | No | Yes

4. Documentation is sparse about this usage and its guarantees. Iirc, there were a few examples in OSS documentation which used high level consumer. Users have mistakenly used these samples with `ConsumerType.HIGHLEVEL` and ended up in long debugging sessions. One example is https://apache-pinot.slack.com/archives/CDRCA57FC/p1687987849496959?thread_ts=1687912445.703689&cid=CDRCA57FC. (when the original incident happened, we spent ~1-2 days debugging before realizing that the stream type is high level)
5. Known issues:
* Deleting a high level table does not clean up the ZK metadata (ideal state & segment store)

I would like to propose that we find a path to sunset the stream level consumption model. but I don't want to proceed without understanding the above questions. Please help clarify. 

I also see comments like "This can be removed once we remove HLC implementation from the code" [link](https://github.com/apache/pinot/blob/master/pinot-spi/src/main/java/org/apache/pinot/spi/stream/PartitionLevelStreamConfig.java#L30)  . So, I am assuming this topic has come up before for discussion :) 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Questions about Stream level consumption model #10996

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature	HLC	LLC
Force commit	No	Yes
Stream Message metadata extraction	No (can potentially be extended)	Yes
Ingestion throttling	No	Yes

Questions about Stream level consumption model #10996

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions