summaryrefslogtreecommitdiff
path: root/docs/reference/modules/gateway/hadoop.asciidoc
blob: b55a4be5332b2f797bbbc844b31a01eaa7474cd7 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
[[modules-gateway-hadoop]]
=== Hadoop Gateway

*The hadoop gateway is deprecated and will be removed in a future
version. Please use the
<<modules-gateway-local,local gateway>>
instead.*

The hadoop (HDFS) based gateway stores the cluster meta and indices data
in hadoop. Hadoop support is provided as a plugin and installing is
explained https://github.com/elasticsearch/elasticsearch-hadoop[here] or
downloading the hadoop plugin and placing it under the `plugins`
directory. Here is an example config to enable it:

[source,js]
--------------------------------------------------
gateway:
    type: hdfs
    hdfs:
        uri: hdfs://myhost:8022
--------------------------------------------------

[float]
==== Settings

The hadoop gateway requires two simple settings. The `gateway.hdfs.uri`
controls the URI to connect to the hadoop cluster, for example:
`hdfs://myhost:8022`. The `gateway.hdfs.path` controls the path under
which the gateway will store the data.

[float]
==== concurrent_streams

The `gateway.hdfs.concurrent_streams` allow to throttle the number of
streams (per node) opened against the shared gateway performing the
snapshot operation. It defaults to `5`.