Elasticsearch#

Elasticsearch provides enterprise-scale deployments with optimized search performance, dedicated indexing and usage resourcing via cluster support without performance degradation and timeouts, resulting in faster, more predicable search results. Mattermost’s implementation uses Elasticsearch as a distributed, RESTful search engine supporting highly efficient database searches in a cluster environment.

Important

The default Mattermost database search starts to show performance degradation at around 2 million posts, on a server with 32 GB RAM and 4 CPUs. If you anticipate your Mattermost server reaching more than 2.5 million posts, we recommend enabling Elasticsearch for optimum search performance before reaching 3 million posts.
For deployments with over 5 million posts, Elasticsearch is required to avoid significant performance issues (such as timeouts) with search and at-mentions.
We highly recommend that you set up Elasticsearch on a different machine than the Mattermost Server.

Deployment guide#

Elasticsearch allows you to search large volumes of data quickly, in near real-time, by creating and managing an index of post data. The indexing process can be managed from the System Console after setting up and connecting an Elasticsearch server. The post index is stored on the Elasticsearch server and updated constantly after new posts are made. In order to index existing posts, a bulk index of the entire post database must be generated.

Deploying Elasticsearch includes the following two steps: setting up the Elasticsearch server, and configuring Elasticsearch in Mattermost.

Set up an Elasticsearch server#

Download and install the latest release of Elasticsearch v8, or Elasticsearch v7.17+. See the Elasticsearch documentation for installation details.

Important

Mattermost v9.11 adds support for Elasticsearch v8 as well as beta support for Opensearch v1.x and v2.x.
Mattermost supports Elasticsearch v7.17+. However, we recommend upgrading your Elasticsearch v7 instance to v8.x. See the Elasticsearch upgrade documentation for details.
When using Elasticsearch v8, ensure you set action.destructive_requires_name to false in elasticsearch.yml to allow for wildcard operations to work.

AWS Elasticsearch

If you’re using AWS Elasticsearch, you must:

Upgrade to AWS Opensearch. Refer to the Opensearch upgrade documentation for details.

Disable “compatibility mode” in Opensearch.

Upgrade Mattermost server.

Update the Mattermost ElasticsearchSettings.Backend configuration setting value from elasticsearch to `opensearch` manually or using mmctl. This value cannot be changed using the System Console. See the Mattermost Elasticsearch backend type documentation for additional details.

Restart the Mattermost server.

Set up Elasticsearch with systemd by running the following commands:

sudo /bin/systemctl daemon-reload
sudo /bin/systemctl enable elasticsearch.service
sudo systemctl start elasticsearch.service

Confirm Elasticsearch is working on the server by running the following command:

curl localhost:9200

Get your network interface name by running the following command:

ip addr

Edit the Elasticsearch configuration file in vi by running the following command:

vi /etc/elasticsearch/elasticsearch.yml

In this file, replace the network.host value of _eth0_ with your network interface name, and save your changes.
Restart Elasticsearch by running the following commands:

sudo systemctl stop elasticsearch
sudo systemctl start elasticsearch

Confirm the ports are listenings by running the following command:

netstat -plnt
You should see the following ports, including the ones listening on ports 9200 and 9300. Confirm these are listening on your server’s IP address.

Create an Elasticsearch directory and give it the proper permissions.
Install the icu-analyzer plugin to the /usr/share/elasticsearch/plugins directory by running the following command:

sudo /usr/share/elasticsearch/bin/elasticsearch-plugin install analysis-icu

Test the connection from Mattermost to Elasticsearch by running the following command:

curl 172.31.80.220:9200

Configure Elasticsearch in Mattermost#

Follow these steps to configure Mattermost to use your Elasticsearch server, and to generate the post index.

Tip

Advanced Elasticsearch configuration settings for large deployments can be configured outside the System Console in the config.json file. See the Elasticsearch configuration settings documentation to learn more.

Go to System Console > Environment > Elasticsearch.
Set Enable Elasticsearch Indexing to true to enable the other the settings on the page. Once the configuration is saved, new posts made to the database are automatically indexed on the Elasticsearch server.
Set the Elasticsearch server connection details:

Enter Server Connection Address for the Elasticsearch server you set up earlier.

(Optional) Enter Server Username used to access the Elasticsearch server. For AWS Elasticsearch, leave this field blank.

(Optional) Enter Server Password associated with the username. For AWS Elasticsearch, leave this field blank.

Set Enable Cluster Sniffing (Optional). Sniffing finds and connects to all data nodes in your cluster automatically. For AWS Elasticsearch, this field should be set to false.

Tip

From Mattermost v7.8, optional CA and client certificate configuration settings are available for use with basic auth credentials or to replace them. See the Elasticsearch configuration settings documentation for details.

Select Test Connection and then select Save. If the server connection is unsuccessful you won’t be able to save the configuration or enable searching with Elasticsearch.
Select Build Index to build the post index of existing posts. This process can take up to a few hours depending on the size of the post database and number of messages. The progress percentage can be seen as the index is created. To avoid downtime, set Enable Elasticsearch for search queries to false so that database search is available during the indexing process.

Important

Complete bulk indexing before enabling Elasticsearch in the next step. Otherwise, search results will be incomplete.

Enable Elasticsearch by setting Enable Elasticsearch for search queries to true, and setting Enable Elasticsearch for autocomplete to true.
Save your configuration updates and restart the Mattermost server.

Note

Additional advanced Elasticsearch settings for large deployments can be configured outside the System Console in the config.json file. Read the Elasticsearch configuration settings documentation to learn more.
If your deployment has a large number of posts (typically in excess of one million but not strictly defined), the reindexing progress percentage may stay at 99% for a long time. The size of the data to be indexed is estimated, and on large databases, estimations can become inaccurate. While progress estimates may be inaccurate, and the progress percentage may appear stuck at near completion, indexing will continue behind the scenes until complete.
Search results for files shared before upgrading to Mattermost Server v5.35 may be incomplete until an extraction command is run using the mmctl. After running this command, the search index must be rebuilt. Go to System Console > Environment > Elasticsearch > Bulk Indexing, then select Index Now to rebuild the search index to include older file contents.
For high post volume deployments, we strongly encourage you to read and properly configure the Mattermost LiveIndexingBatchSize configuration setting.

Limitations#

Elasticsearch uses a standard selection of “stop words” to keep search results relevant. Results for the following words will not be returned: “a”, “an”, “and”, “are”, “as”, “at”, “be”, “but”, “by”, “for”, “if”, “in”, “into”, “is”, “it”, “no”, “not”, “of”, “on”, “or”, “such”, “that”, “the”, “their”, “then”, “there”, “these”, “they”, “this”, “to”, “was”, “will”, and “with”.
Searching stop words in quotes returns more results than just the searched terms (ticket).
Search results are limited to a user’s team and channel membership. This is enforced by the Mattermost server. The entities are indexed in Elasticsearch in a way that allows Mattermost to filter them when querying, so the Mattermost server narrows down the results on every Elasticsearch request applying those filters.

Frequently asked questions (FAQ)#

Do I need to use Elasticsearch?#

The Elasticsearch engine is designed for large Enterprise deployments to run highly efficient database searches in a cluster environment. The default Mattermost database search starts to show performance degradation at around 2.5 million posts, depending on the specifications for the database server. If you expect your Mattermost server to have more than 2.5 million posts, we recommend using Elasticsearch for optimum search performance.

Should I install Elasticsearch on the same machine as Mattermost Server?#

No. We strongly recommend that you install Elasticsearch on a different machine than the Mattermost Server.

What types of indexes are created?#

Mattermost creates three types of indexes: users, channels, and posts. Users and channels have one index each. Posts are aggregated by date, into multiple indexes.

Can I pause an Elasticsearch indexing job?#

Yes. From Mattermost v6.7, the Elasticsearch indexing job is resumable. Stopping a server while the Elasticsearch indexing job is running puts the job in pending status. The job resumes when the server restarts. System admins can cancel an indexing job through the System Console.

Can an index rollover policy be defined?#

The AggregatePostsAfterDays configuration setting defines a cutoff value. All posts preceding this value are reindexed and aggregated into new and bigger indexes. The default setting is 365 days.

Are there any new search features offered with Elasticsearch?#

The current implementation of Elasticsearch matches the search features currently available with database search. The Mattermost team is working on extending the Elasticsearch feature set with file name and content search, date filters, and operators and modifiers.

Are my files stored in Elasticsearch?#

No, files and attachments are not stored.

How do I monitor system health of an Elasticsearch server?#

You can use this Prometheus exporter to monitor various metrics about Elasticsearch: justwatchcom/elasticsearch_exporter.

You can also refer to this article about Elasticsearch performance monitoring. It’s not written specifically for Prometheus, which Mattermost’s performance monitoring system uses, but has several tips and best practices.

What form of data is sent to Elasticsearch?#

Mattermost communicates with Elasticsearch through its REST API using JSON messages for indexing and querying entities.

How much data is sent to Elasticsearch and when?#

Every time a message is published, a channel is created, or a user changes, (either because their properties change e.g.: change of the first name or because they join/leave a channel), the data associated with that event is sent to Elasticsearch.

If search via Elasticsearch is enabled, every search will generate a query. If autocompletion is enabled, every user or channel autocompletion associated with writing a message or user search will generate a query.

How do I know if an Elasticsearch job fails?#

Mattermost provides the status of each Elasticsearch indexing job in System Console > Environment > Elasticsearch. Here you can see if the job succeeded or failed, including the details of the error.

Failures are returned in the server logs. The error log begins with the string Failed job and includes a job_id key/value pair. Elasticsearch job failures are identified with worker name EnterpriseElasticsearchAggregator and EnterpriseElasticsearchIndexer. You can optionally create a script that programmatically queries for such failures and notifies the appropriate system.

My Elasticsearch indexes won’t complete, what should I do?#

If you have an Elasticsearch indexing job that’s paused, it’s likely your Elasticsearch server has restarted. If you restart your Elasticsearch server, you must also restart Mattermost to ensure jobs are completed. If restarting the Mattermost server does not resolve the issue, contact Mattermost Support.

Do I need to purge first then bulk index each time?#

Yes.

Required Permissions For Mattermost Service Account#

In “least privilege” environments you may need to further constrain the service account permissions to limit the access your Elasticsearch service account has. The following JSON provides an example of a “least privilege” permission set that allows Mattermost to operate correctly with Elasticsearch:

{
  "cluster_permissions": [
    "cluster:monitor/*",
    "indices:admin/template/put",
    "indices:data/write/bulk"
  ],
  "index_permissions": [
    {
      "index_patterns": [
        "\<IndexPrefix\>*"
      ],
      "allowed_actions": [
        "indices:admin/get",
        "indices:admin/create",
        "indices:admin/delete",
        "indices:admin/mapping/put",
        "indices:admin/mappings/fields/get*",
        "indices:data/read*",
        "indices:data/write*"
      ]
    }
  ]
}

A simpler, more flexible, and resilient variant of the above would be:

{
  "cluster_permissions": [
    "cluster:monitor/*",
    "indices:admin/template/put",
    "indices:data/write/bulk"
  ],
  "index_permissions": [
    {
      "index_patterns": [
        "\<IndexPrefix\>*"
      ],
      "allowed_actions": [
        "indices:*"
      ]
    }
  ]
}