-
Notifications
You must be signed in to change notification settings - Fork 46
Closed
Labels
enhancementNew feature or requestNew feature or request
Description
Search before asking
- I had searched in the issues and found no similar feature requirement.
Problem Description
Flink applications maintained by Streamis may lose job status (data) when they fail to automatically pull up or stop active pull up. Checkpoint files are required to save and recover job status. Therefore, methods to obtain checkpoint file information are required.
Streamis维护的Flink应用作业在失败自动拉起和停止主动拉起的过程中,可能会发生作业状态(数据)丢失的现象,需要借助checkpoint文件来对作业的状态进行保存和恢复,所以要提供获取checkpoint文件信息的方法。
Description
- Use Linkis HttpClient, GetDirFileTrees method under FsRestfulApi entry class of linkis-storage-script-dev-server module of Linkis; Construct the corresponding client Request and Response. On the Streamis side, the default checkpoint/savepoint file root directory should be saved in configuration files. After concatenating the file with the task name, the checkpoint/savepoint root directory of the task can be obtained and used as the request parameter. Call the getDirFileTrees method to get the subdirectory tree under that directory.
- Obtain a list of qualified subdirectories based on the subdirectory tree. Use the latest subdirectory as the recovery directory.
1 使用Linkis的HttpClient,对应Linkis的linkis-storage-script-dev-server模块的FsRestfulApi入口类下的getDirFileTrees方法,构建相应的客户端Request和Response。Streamis侧应该通过配置文件等形式保存默认的checkpoint/savepoint文件根目录,与具体的任务名称做拼接后,得到该任务的checkpoint/savepoint根目录,并以此为请求参数入参,调用getDirFileTrees方法,获得该目录下的子目录树。
2 根据子目录树,拿到符合要求的子目录列表,取最新的子目录作为恢复目录,该子目录的信息即为最新的checkpoint文件信息。
Use case
No response
solutions
No response
Anything else
No response
Are you willing to submit a PR?
- Yes I am willing to submit a PR!
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request