我已经做过数据流教程了。接收器当前创建了4个文件到Azure数据湖Gen2。我想这与HDFS文件系统有关。
是否可以保存未成功、已提交、已启动的文件?
什么是最佳实践?是否应在保存到数据湖gen2后将其删除?然后在进一步的数据处理中需要吗?
https://docs.microsoft.com/en-us/azure/data-factory/tutorial-data-flow
发布于 2021-09-27 07:11:59
有几个选项可用。
- Select [**Output to single file**](https://docs.microsoft.com/en-us/azure/data-factory/data-flow-sink#sink-settings) from the dropdown of file name option and give the output file name.- You could also parameterize the output file name as required. Refer to this [SO](https://stackoverflow.com/questions/61123171/how-to-set-the-file-name-based-on-a-parameter-in-the-sink) thread.

https://stackoverflow.com/questions/69311896
复制相似问题