Implement an LRU Cache in Go

Front page > Programming > Implement an LRU Cache in Go

Implement an LRU Cache in Go

Published on 2024-08-06

Browse:176

Implement an LRU Cache in Go

So you need a small cache and can't justify a Redis or memcached instance. Let's see what it takes to implement one in Go. For fun, we'll make it using generics so it is reusable in our project.

An LRU cache generally has a fixed capacity and the simplest ejection policy: eject the element that has the longest time since it was accessed. A simple lru cache will implement the following interface:

type LRUCache[T any] interface {
    Get(key string) (value T, found bool)
    Put(key string, value T)
    Keys() []string
    Remove(key string) bool
    Clear()
    Capacity() int
    Len() int
}

We know the cache will store a data item as an entry that is keyed by some value. That sounds like a map. What about implementing the ejection policy? One way to do this is to keep a timeAccessed property along with each item. Something like:

type cacheEntry[T any] struct {
  Data T
  LastAccessed time.time
}

However, let's think about performance, we want to be able to search for the cache key as well as insert and evict the oldest, if necessary, as fast as possible.

Using a map, which is a hashtable, will give us pretty fast performance for lookups. What about finding the oldest entry? If your cache struct looks like:

type LRUCache[T any] {
  capacity int
  keyMap map[string]cacheEntry[T]
}

We will necessarily need to iterate over the map to find the oldest when it comes time to evict an entry.

We need a way to store entries in a way that will efficiently allow us to keep a list of cache entries sorted. It is preferable that we not need to use a sort routine.

A double linked list is a good way to do this and we don't need to store access time in the entry unless we actually want it. So let's suppose we have a linked list that implements the following along with its node struct:

type DoubleLinkedList[T any] interface {
    Head() *DoubleNode[T]
    Tail() *DoubleNode[T]
    // Append inserts new item at tail
    Append(data T) *DoubleNode[T]
    // Push appends new item at head
    Push(data T) *DoubleNode[T]
    Remove(node *DoubleNode[T]) *DoubleNode[T]
    RemoveTail() *DoubleNode[T]
    MoveToHead(node *DoubleNode[T])
}
type DoubleNode[T any] struct {
    Data T
    Prev *DoubleNode[T]
    Next *DoubleNode[T]
}

The cache struct can now look something like:

type lruCache[T any] struct {
    capacity int
    keyMap   map[string]*DoubleNode[lruCacheEntry[T]]
    list     DoubleLinkedList[lruCacheEntry[T]]
}

The cache entry struct will be:

type lruCacheEntry[T any] struct {
    key   string
    value T
}

Realistically, you would probably use an interface for the cache key. I am using a string to keep the code simple.

In the implementation here, the most recently accessed entry in the cache will be at the head and the least recently used will be at the tail. So, when it comes time to evict, we simply remove the tail element of the linked list.

Implementing the Get() function is simple:

func (l *lruCache[T]) Get(key string) (value T, found bool) {
    if node, ok := l.keyMap[key]; ok {
        l.list.MoveToHead(node)
        return node.Data.value, ok
    }
    var zero T
    return zero, false
}

Get just needs to retrieve the map entry for the key, then move the node to the head of the list since it it now the 'most recently used'.

The Put() function is where we will handle the eviction if it's necessary:

func (l *lruCache[T]) Put(key string, value T) {
    if node, ok := l.keyMap[key]; ok {
        node.Data = lruCacheEntry[T]{
            key:   key,
            value: value,
        }
        // move the element to the most recent position
        l.list.MoveToHead(node)
    } else {
        // insert the new element at the head
        newNode := l.list.Push(lruCacheEntry[T]{
            key:   key,
            value: value,
        })
        l.keyMap[key] = newNode
    }
    // is eviction necessary
    if len(l.keyMap) > l.capacity {
        nodeRemoved := l.list.RemoveTail()
        delete(l.keyMap, nodeRemoved.Data.key)
    }
}

For Put(), we first check to see if there is already a value for the given key. If so, then update the value and move the node to the head of the list. Otherwise, we create a new cache entry, add it to the list as the head, and add it to our map.

Finally, don't forget to check for capacity. If the new entry puts us over the capacity, we evict the oldest entry which is the tail of the list and delete the entry from our map.

Note that storing the key as part of the cache entry allows us to rapidly delete the key from the map. If we had only stored the data in the cache entry, then we would need to iterate over the map to find it.

This cache is missing something critical for a multi-threaded app. There is no synchronization. Realistically, a cache would be accessed by multiple threads. Synchronization is a complex topic. For our implementation, we can add a mutex to the cache struct:

type lruCache[T any] struct {
    capacity int
    keyMap   map[string]DoubleNode[lruCacheEntry[T]]
    list     DoubleLinkedList[lruCacheEntry[T]]
    mutex    sync.RWMutex
}

then add the following at the start of each function.

    l.mutex.Lock()
    defer l.mutex.Unlock()

Notice that we are using a read/write lock. Some of the functions don't change the structure of the cache, so we can use the read lock method provided, for example the Len() function:

func (l *lruCache[T]) Len() int {
    l.mutex.RLock()
    defer l.mutex.RUnlock()
    return len(l.keyMap)
}

Note that the synchronization strategy chosen here may break down if there are a large number of threads trying to access the cache. It's a complex topic that could be a series of posts in itself.

See the full implementation in the repository given in the link below.

What would you do different to implement a cache? How would you address synchronization? I am interested in hearing your thoughts on this one. There's no single solution to this so put your comments below.

Thanks!

The code for this post and all posts in this series can be found here

Release Statement This article is reproduced at: https://dev.to/johnscode/implement-an-lru-cache-in-go-1hbc?1 If there is any infringement, please contact [email protected] to delete it

Latest tutorial More>

How to Sort Data by String Length in MySQL Using CHAR_LENGTH()?
Selecting Data by String Length in MySQLTo sort data based on string length in MySQL, instead of using string_length(column), consider using the built...

Programming Posted on 2025-02-07
How Can I UNION Database Tables with Different Numbers of Columns?
Combined tables with different columns] Can encounter challenges when trying to merge database tables with different columns. A straightforward way i...

Programming Posted on 2025-02-07
How Can I Efficiently Count Element Occurrences in a Java List?
Counting Element Occurrences in a ListWithin the realm of Java programming, the task of enumerating element occurrences within a list comes to the for...

Programming Posted on 2025-02-07
How to Send Files to Users from a PHP Server?
Send File to the UserWhen a user interacts with a PHP script, you may encounter scenarios where you need to transmit a file, such as a PDF, to the cli...

Programming Posted on 2025-02-07
Microbenchmarking: Is it a reliable tool for evaluating system performance?
Microbenchmarking: What It Is and What It Isn'tMicrobenchmarking is a practice that involves measuring the performance of a specific isolated piec...

Programming Posted on 2025-02-07
How Can I Efficiently Join Tables in SQL to Retrieve Data from Multiple Columns?
Retrieving Data from Multiple Columns Using SQL Table Joins This guide demonstrates how to efficiently combine data from multiple SQL tables to retri...

Programming Posted on 2025-02-07
Why Does Microsoft Visual C++ Fail to Correctly Implement Two-Phase Template Instantiation?
The Mystery of "Broken" Two-Phase Template Instantiation in Microsoft Visual C Problem Statement:Users commonly express concerns that Micro...

Programming Posted on 2025-02-07
Does `exec()` Update Local Variables in Python 3, and If Not, How Can It Be Made To?
exec's Impact on Local Variables: A Dive InThe exec function, a Python staple for dynamic code execution, poses an intriguing query: can it update...

Programming Posted on 2025-02-07
How to Ensure Hibernate Preserves Enum Values When Mapping to a MySQL Enum Column?
Preserving Enum Values in Hibernate: Troubleshooting Wrong Column TypeIn the realm of data persistence, ensuring the compatibility between data models...

Programming Posted on 2025-02-07
Why Does My C++ Code Produce "Error LNK2019: Unresolved External Symbol _WinMain@16"?
Resolving "Error LNK2019: Unresolved External Symbol _WinMain@16 Referenced in ___tmainCRTStartup"While attempting to execute a basic code s...

Programming Posted on 2025-02-07
$How to Implement Custom Exception Handling with Python\'s Logging Module?$
How to Implement Custom Exception Handling with Python\'s Logging Module?
Custom Error Handling with Python's Logging ModuleEnsuring that uncaught exceptions are properly handled and logged can be crucial for troubleshoo...

Programming Posted on 2025-02-07
How to Check if an Object Has a Specific Attribute in Python?
Method to Determine Object Attribute ExistenceThis inquiry seeks a method to verify the presence of a specific attribute within an object. Consider th...

Programming Posted on 2025-02-07
How Can I Add Items to an IEnumerable?
Adding Items to an IEnumerableMany developers seek a method like items.Add(item) for adding elements to an IEnumerable collection. However, this canno...

Programming Posted on 2025-02-07
How Can I Reliably Check for Column Existence in a MySQL Table?
Determining Column Existence in a MySQL TableIn MySQL, verifying the presence of a column in a table can be a bit perplexing compared to other databas...

Programming Posted on 2025-02-07
How Can I Create Empty Indented Blocks in Python Without Errors?
Avoiding IndentationError When Writing Empty Indented Blocks in PythonWhen writing Python code, it's not uncommon to encounter an "expected a...

Programming Posted on 2025-02-07