使用tispark+structured streaming执行aggregations操作时需保存state,无法为checkpoint指定“This checkpoint location has to be a path in an HDFS compatible file system”导致在程序运行时报错。
目前我们分析环境tispark+tidb,没有hdfs或类似的分布式文件系统,有没有其他解决方案。
ps:网上有提到使用OSS存储,单目前内网开发环境,无法联网,而且oss付费系统不太合适
0/03/26 18:57:49 WARN TaskSetManager: Lost task 0.0 in stage 1.0 (TID 1, 192.168.10.203, executor 1): java.lang.IllegalStateException: Error reading delta file file:/data/checkpoint/test/state/0/0/1.delta of HDFSStateStoreProvider[id = (op=0,part=0),dir = file:/data/checkpoint/test/state/0/0]: file:/data/checkpoint/test/state/0/0/1.delta does not exist