Expand description
webhdfs
sink.
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. HDFS consists of a namenode and a datanode. We will send rpc to namenode to know which datanode to send and receive data to. Also, HDFS will rebalance data across the cluster to make sure each file has enough redundancy.
┌───────────────┐
│ Data Node 2 │
└───────────────┘
▲
┌───────────────┐ │ ┌───────────────┐
│ Data Node 1 │◄──────────┼───────────►│ Data Node 3 │
└───────────────┘ │ └───────────────┘
┌───────┴───────┐
│ Name Node │
└───────────────┘
▲
│
┌──────┴─────┐
│ Vector │
└────────────┘
WebHDFS will connect to the HTTP RESTful API of HDFS.
For more information, please refer to:
webhdfs
is an OpenDal based services. This mod itself only provide
config to build an crate::sinks::opendal_common::OpenDalSink
. All real implement are powered by
crate::sinks::opendal_common::OpenDalSink
.
Structs§
- Configuration for the
webhdfs
sink.