Caching Without Compromises and Spring

  • 7 April 2019
  • 7 minutes to read
  • Share on:

Caching is important to build efficient software. And there is a large variety of excellent caching libraries in Java that will help you with it. However, in Spring-based applications, all of them are primarily used indirectly via Spring caching abstractions. If you go through any Spring tutorial on caching you will be easily convinced that adding a cache to a Spring app is an absolute no-brainer.

Want to cache a result of the method invocation?
Annotate this method with @Cacheable and Spring will do the rest.
– Author X, Every Spring Tutorial

But when it comes to a slightly non-trivial caching problem or when the application is being heavily refactored, Spring force is leaving you and everything breaks apart. Let’s see why it happens and what are the alternatives.

Caching with Spring by Example

We’ll explore and challenge Spring caching while implementing a very primitive wrapper for the async HTTP client that caches responses in Redis.

Let’s test the caching mechanism with a simple ConcurrentMap in-memory cache to verify that responses are actually being cached per a combination of a request method and URL.

For the purpose of testing, we’ll use HttpClient implementation that returns a response containing a randomly generated string.

Here is the config.

And here is the test.

Test results are successful as we expected. We already got some confidence, let’s just replace ConcurrentMap with Redis and ship the code to production.

The only thing that needs to change, as per Spring documentation, is the configuration.
With latest Spring Boot it should work like magic, but let’s look under the hood and configure the RedisCacheManager ourselves.

In contrast with a ConcurrentMap Redis is a remote service and we need to configure serialization to be able to exchange data with it. Keys of the cache entries will be simple String’s, and values will be serialized to JSON.
The only working configuration of the JSON serializer stores a class name of a serialized object in a payload. If we forget (or refuse) to configure it this way, we’ll get the following error.

This approach has one major drawback. If somebody moves or renames the class of the objects cached in Redis, we’re screwed - deserialization will fail, and the method call will result in a runtime exception (the method won’t be called without caching).

If you wonder which serialization Spring Boot uses by default, you’d be surprised that it’s the built-in Java serialization which is slow, inefficient, insecure and has backward compatibility issues.

Hmm… Our JSON serialization is at least better than the default. Anyway, let’s run the test with the switched config. And this is what we get.

What’s Wrong?

Although CompletableFuture is special abstraction for async programming in Java, Spring doesn’t handle it differently. Spring tries to serialize it as a regular Java object and fails. This issue is relevant to caching in all external datastores. It was reported to the Spring Framework team and closed without a fix.

But why and how does the in-memory version work???
Since serialization isn’t required, the instance of CompletableFuture is successfully saved in a ConcurrentMap as any other Java object. Now imagine what happens if the asynchronous computation behind the cached CompletableFuture fails. The failed CompletableFuture is returned from cache until the TTL expires. In practice, the impact of this problem ranges from the intermittent degradation of the application functionality up to severe incidents depending on how close the caching is to the critical path.

If we replace CompletableFuture with a new shiny reactive Mono or Flux types, the situation becomes different. Mono and Flux describe a publisher. The Spring in-memory cache will store these publishers. When we get the publisher from the cache and subscribe to it, the publisher re-runs the action needed to produce the value instead of getting the value from cache. It works as if there was no cache at all! Although it’s useless, it at least doesn’t break anything. However, with a remote cache, the application will still crash.

If you’re still optimistic about Spring caching, keep in mind the following:

  1. Spring abstractions for caching are very primitive and are not suitable for even slightly more complicated use cases such as caching of multiple values inside a single method, caching of async results, dynamic TTLs, custom caching strategies like parallel cache lookup and backward compatibile caching in remote datastores. Moreover, even the basic functionality that Spring provides is hard to test and debug - it’s all hidden behind autogenerated proxies in legacy trenches of 15 years old framework.
  2. Spring AOP. Every class you annotated with @Cacheable must be declared as a bean and caching will work only for public methods called from the outside (compile-time weaving is required for non-public methods).
  3. Spring Expression Language (SpEL). @Cacheable annotation parameters can optionally contain SpEL expressions. SpEL can be used to define complex expressions for keys and caching conditions based on method parameters. Java compiler doesn’t have a clue about SpEL. Each mistake in a SpEL expression will result in a runtime exception.
  4. Spring Test. Testing requires the bootstrapping of Spring application context and Spring-specific test tools that make your test suite unintentionally complicated and slow.

What Can We Do Better?

We can replace @Cacheable with an interface representing required caching capabilities and inject it into our client class.

The cache key is now defined in a type-safe way. The silliest mistakes will be caught by the compiler.
The block of code asynchronously producing a cached value is turned into a Supplier<CompletableFuture<T>>. It’s an expression evaluated only when the cached value is not found.
The cache name is an implementation detail in our case. It’s static and therefore can be configured outside of the client class to avoid situations when the client class uses a cache name that’s not defined in the configuration.

The configuration of CachingHttpClient looks as follows.

And the testing environment doesn’t require Spring. No magic, pure Java.

That’s the API I want to use. It’s simple, gives enough control and prevents from a wide range of mistakes. We don’t need to study tons of Spring documentation to add a predictable caching mechanism to the project. Instead, we combine small composable parts using the fundamental features of a statically typed programming language. Important parts (like serialization mechanism) are explicit and mandatory, invalid states (like not compiling expression for the cache key) are not representable. The code is easily testable and debuggable even by the most junior colleagues who haven’t dived into the enterprise Java world fully yet.

If you want to check the implementation details or play with the code from this blog post, go to the GitHub repo and check out RedisCacheable class that asynchronously caches data in Redis.