February 17, 2017
10 min read time

Varnish Web Developer Wiki Highlights: Drupal with Varnish step-by-step

It will probably not come as a big surprise to anyone that some of the world's most popular and powerful content management systems - as great as they are - can use a little bit of help when it comes to performance. It will also not be surprising to many that the flexibility of Varnish Cache makes it an ideal complement to these CMSes in boosting performance. Last week, we shared the tutorial on using Varnish with WordPress via the Varnish Developer Wiki... and this week, we bring you another tutorial on another major CMS: straight from the Varnish with Drupal section please read the step-by-step guide to make Drupal 8 site fly with Varnish.

It's a wiki, meaning that we hope and expect that you will help us to expand it and keep it up to date. I'm publishing it here in the blog for reference. In this tutorial, we give the step-by-step process that will help you install and configure Varnish to take your Drupal-based site to the next level. So let's get started.

This article assumes that you have a running instance of Drupal 8 and that you have administrator rights for said instance, both at the OS and application level. We have tested this using Ubuntu LTS 16.04, Varnish Cache 4.1 and Drupal 8.

If you still need help Installing Drupal 8 visit the Drupal-Site

1. Installing Varnish

In case you also do not have Varnish, you will need to follow the instructional section on how to Install Varnish before we can continue.

2. How to place Drupal 8 behind Varnish

Now that you have set up Varnish in-front of your Drupal 8 installation, and have apache2 configured, you need to know how to configure Drupal to purge cached content. By default anonymous page caching is enabled.

To configure caching in Drupal, log in as an administrator on your Drupal site.

  • Go to the Configuration Menu
  • Click on Performance
  • Locate Caching and check both the boxes
    x Cache pages for anonymous users (checked by default) x Cache blocks
  • Set the Minimum cache lifetime; ~ 60min
  • Set Expiration of cached pages; ~ 60min

For Drupal’s Performance settings go to /admin/config/development/performance.

Set the value for ‘Page cache maximum age’ as shown below:

Sphinx Neo-Hittite

Always choose caching time keeping in mind both better site performance and the need to ensure that the cache is not stale for too long. This value will solely depend on the type of site you have, its content and what it purpose it serves.

Drupal does two things:

  1. It sends the Purge-Cache-Tags header with every request,
    containing a space-separated list of all the page’s cache tags.
  2. It also sends a BAN request with the appropriate cache tags whenever content
    or configuration is updated that should expire pages with the associated cache tags.

Both of these can be achieved quickly and easily by enabling and configuring the Purge and Generic HTTP Purger modules. Read about the suggested plugins on on our main page.

Next you need to add a ‘purger’ that will send the appropriate BAN requests using purge_purger_http: visit the Purge configuration page, admin/config/development/performance/purge,

Then follow the steps below as in the image:

  1. Add a new purger
Sphinx Neo-Hittite
  1. Choose ‘HTTP Purger’ and click ‘Add’:
Sphinx Neo-Hittite

iii. Configure the Purger’s name (“Varnish Purger”), Type (“Tag”), and Request settings (defaults for Drupal VM are hostname 127.0.0.1, port 81, path /, method BAN, and scheme http):

Sphinx Neo-Hittite

iv. Configure the Purger’s headers (add one header Purge-Cache-Tags with the value [invalidation:expression]):

Sphinx Neo-Hittite

Note from the Original Author: Don’t use the header in the screenshot—use Purge-Cache-Tags!

Images and textual courtesy: Jeff Geerling’s Post on Drupal 8 and Varnish

  • Lastly Save the configuration and test that Varnish is working.
  • Then move on to more advanced stuff; personalized caching is a recommendation.

This is a basic configuration for Varnish and Drupal; to go into more depth, use VCL to write your own customized code. This wiki contains some templates and examples.

3. Caching

Varnish caches everything, so you need to write a rule to exclude what you do not want to cache.

By default, Varnish caches two types of requests: GET and HEAD. Other requests like DELETE, POST and PUT are never cached. That means you do not have to worry about requests that make changes to data because they are allowed to get to the application.

4. Excluding URLS

Pages protected using HTTP Authorization are never cached. For your application-specific mechanisms, you need to add a rule such as the following to ensure that login pages aren’t cached.

# exclude drupal login url from caching

  if (req.url ~ "^/status\.php$" ||
     req.url ~ "^/update\.php" ||
     req.url ~ "^/install\.php" ||
     req.url ~ "^/admin" ||
     req.url ~ "^/admin/.*$" ||
     req.url ~ "^/user" ||
     req.url ~ "^/user/.*$" ||
     req.url ~ "^/users/.*$" ||
     req.url ~ "^/info/.*$" ||
     req.url ~ "^/flag/.*$" ||
     req.url ~ "^.*/ajax/.*$" ||
     req.url ~ "^.*/ahah/.*$") {
     return (pass);
  }

If we did end up caching login pages, we could end up serving the same content to all users, which brings us to our next topic: Cookies!

5. Cookies

Cookies are everywhere these days! And we need some of them. But they are also one of the most important things in the caching decision. Making a choice between which cookies to cache or include is very important for a web application.

Examples such as your site statistics analysis or your website indexing require cookies too. But these are not used by your Drupal site at all but if they didn’t exist on your site, your web content wouldn’t be indexed for searches. Either way these cookies make your site’s content uncacheable and therefore you as the developer have to make caching choices very carefully.

On the other hand, cookies related to page designs and other static content need to be allowed to cache. Below is an example of caching cookies for your Drupal site:

#Cookie example
#Collected from: https://fourkitchens.atlassian.net/wiki/display/TECH/Configure+Varnish+3+for+Drupal+7

# this is an example of Varnish 3, needs to be tested for varnish 4

sub vcl_recv {

if (req.http.Cookie) {
    # removing these styling and photo cookes from here will allow it to be cached
    if (req.url ~ "(?i)\.(css|js|jpg|jpeg|gif|png|ico)(\?.*)?$") {
        unset req.http.Cookie;
    }

    set req.http.Cookie = ";" + req.http.Cookie;
    set req.http.Cookie = regsuball(req.http.Cookie, "; +", ";");
    set req.http.Cookie = regsuball(req.http.Cookie, ";(SESS[a-z0-9]+|SSESS[a-z0-9]+|NO_CACHE)=", "; \1=");
    set req.http.Cookie = regsuball(req.http.Cookie, ";[^ ][^;]*", "");
    set req.http.Cookie = regsuball(req.http.Cookie, "^[; ]+|[; ]+$", "");

    if (req.http.Cookie == "") {
      # If there are no remaining cookies, remove the cookie header. If there
      # aren't any cookie headers, Varnish's default behavior will be to cache
      # the page.
      unset req.http.Cookie;
    }
    else {
      # If there is any cookies left (a session or NO_CACHE cookie), do not
      # cache the page. Pass it on to Apache directly.
      return (pass);
    }
}

# removing cookies for static files

sub vcl_backend_response {
    # Remove cookies for stylesheets, scripts and images used throughout the site.
    # Removing cookies will allow Varnish to cache those files. It is uncommon for
    # static files to contain cookies, but it is possible for files generated
    # dynamically by Drupal. Those cookies are unnecessary, but could prevent files
    # from being cached.
    if (bereq.url ~ "(?i)\.(css|js|jpg|jpeg|gif|png|ico)(\?.*)?$") {
        unset beresp.http.set-cookie;
    }
}

6. Drupal Caching Headers

Drupal sends its own caching information in response headers just like many other web applciations. These headers are obviously important to your web application and if you configure your Varnish to never cache any response, this could destabilize your web application. So you need add some configurations to your VCL code that will cache your Drupal header responses but not cache other headers.

7. Purging

This bit of code is to govern which IP addresses can access the config files.

acl internal {
  "192.x.x.x"/24;
  xxx.xxx.xx.xx;
}
# Allowing which address can access cron.php or install.php,
# add the following in acl.

8. Restart services after making changes

Don’t forget to restart after making changes:

sudo systemctl restart varnish.service

sudo systemctl restart apache2.service

9. Go further

If you are interested in Varnish, you can always give Varnish Plus a go with a free trial. You can capture real-time traffic statistics, create a paywall for premium content, simultaneously work on administration across all Varnish servers, record relationships between web pages for easy content maintenance, detect devices used for browsing, accelerate APIs and more.

Check out the Varnish Wiki

Photo (c) 2012 David used under Creative Commons license.