Usando compiladores JIT para tornar meus loops Python mais lentos?

Primeira página > Programação > Usando compiladores JIT para tornar meus loops Python mais lentos?

Usando compiladores JIT para tornar meus loops Python mais lentos?

Publicado em 01/09/2024

Navegar:179

If you haven't heard, Python loops can be slow--especially when working with large datasets. If you're trying to make calculations across millions of data points, execution time can quickly become a bottleneck. Luckily for us, Numba has a Just-in-Time (JIT) compiler that we can use to help speed up our numerical computations and loops in Python.

The other day, I found myself in need of a simple exponential smoothing function in Python. This function needed to take in array and return an array of the same length with the smoothed values. Typically, I try and avoid loops where possible in Python (especially when dealing with Pandas DataFrames). At my current level of capability, I didn't see how to avoid using a loop to exponentially smooth an array of values.

I am going to walk through the process of creating this exponential smoothing function and testing it with and without the JIT compilation. I'll briefly touch on JIT and how I made sure to code the the loop in a manner that worked with the nopython mode.

What is JIT?

JIT compilers are particularly useful with higher-level languages like Python, JavaScript, and Java. These languages are known for their flexibility and ease of use, but they can suffer from slower execution speeds compared to lower-level languages like C or C . JIT compilation helps bridge this gap by optimizing the execution of code at runtime, making it faster without sacrificing the advantages of these higher-level languages.

When using the nopython=True mode in the Numba JIT compiler, the Python interpreter is bypassed entirely, forcing Numba to compile everything down to machine code. This results in even faster execution by eliminating the overhead associated with Python's dynamic typing and other interpreter-related operations.

Building the fast exponential smoothing function

Exponential smoothing is a technique used to smooth out data by applying a weighted average over past observations. The formula for exponential smoothing is:

S_{t} = α \cdot V_{t} (1 - α) \cdot S_{t - 1} S_t = \alpha \cdot V_t   (1 - \alpha) \cdot S_{t-1}

where:

$S_{t} S_t$ : Represents the smoothed value at time $t t$ .
$V_{t} V_t$ : Represents the original value at time $t t$ from the values array.
$α \alpha$ : The smoothing factor, which determines the weight of the current value $V_{t} V_t$ in the smoothing process.
$S_{t - 1} S_{t-1}$ : Represents the smoothed value at time $t - 1 t-1$ , i.e., the previous smoothed value.

The formula applies exponential smoothing, where:

The new smoothed value $S_{t} S_t$ is a weighted average of the current value $V_{t} V_t$ and the previous smoothed value $S_{t - 1} S_{t-1}$ .
The factor $α \alpha$ determines how much influence the current value $V_{t} V_t$ has on the smoothed value compared to the previous smoothed value $S_{t - 1} S_{t-1}$ .

To implement this in Python, and stick to functionality that works with nopython=True mode, we will pass in an array of data values and the alpha float. I default the alpha to 0.33333333 because that fits my current use case. We will initialize an empty array to store the smoothed values in, loop and calculate, and return smoothed values. This is what it looks like:

@jit(nopython=True) 
def fast_exponential_smoothing(values, alpha=0.33333333): 

    smoothed_values = np.zeros_like(values) # Array of zeros the same length as values
    smoothed_values[0] = values[0] # Initialize the first value 

    for i in range(1, len(values)): 
        smoothed_values[i] = alpha * values[i]   (1 - alpha) * smoothed_values[i - 1]
    return smoothed_values

Simple, right? Let's see if JIT is doing anything now. First, we need to create a large array of integers. Then, we call the function, time how long it took to compute, and print the results.

# Generate a large random array of a million integers
large_array = np.random.randint(1, 100, size=1_000_000)

# Test the speed of fast_exponential_smoothing
start_time = time.time()
smoothed_result = fast_exponential_smoothing(large_array)
end_time = time.time()
print(f"Exponential Smoothing with JIT took {end_time - start_time:.6f} seconds with 1,000,000 sample array.")

This can be repeated and altered just a bit to test the function without the JIT decorator. Here are the results that I got:

Using JIT-compilers to make my Python loops slower?

Wait, what the f***?

I thought JIT was supposed to speed it up. It looks like the standard Python function beat the JIT version and a version that attempts to use no recursion. That's strange. I guess you can't just slap the JIT decorator on something and make it go faster? Perhaps simple array loops and NumPy operations are already pretty efficient? Perhaps I don't understand the use case for JIT as well as I should? Maybe we should try this on a more complex loop?

Here is the entire code python file I created for testing:

import numpy as np
from numba import jit
import time

@jit(nopython=True) 
def fast_exponential_smoothing(values, alpha=0.33333333): 

    smoothed_values = np.zeros_like(values) # Array of zeros the same length as values
    smoothed_values[0] = values[0] # Initialize the first value 

    for i in range(1, len(values)): 
        smoothed_values[i] = alpha * values[i]   (1 - alpha) * smoothed_values[i - 1]
        return smoothed_values

def fast_exponential_smoothing_nojit(values, alpha=0.33333333):

    smoothed_values = np.zeros_like(values) # Array of zeros the same length as values
    smoothed_values[0] = values[0] # Initialize the first value 

    for i in range(1, len(values)): 
        smoothed_values[i] = alpha * values[i]   (1 - alpha) * smoothed_values[i - 1]
        return smoothed_values

def non_recursive_exponential_smoothing(values, alpha=0.33333333):
    n = len(values)
    smoothed_values = np.zeros(n)

    # Initialize the first value
    smoothed_values[0] = values[0]

    # Calculate the rest of the smoothed values
    decay_factors = (1 - alpha) ** np.arange(1, n)
    cumulative_weights = alpha * decay_factors
    smoothed_values[1:] = np.cumsum(values[1:] * np.flip(cumulative_weights))   (1 - alpha) ** np.arange(1, n) * values[0]

    return smoothed_values

# Generate a large random array of a million integers
large_array = np.random.randint(1, 1000, size=10_000_000)

# Test the speed of fast_exponential_smoothing
start_time = time.time()
smoothed_result = fast_exponential_smoothing_nojit(large_array)
end_time = time.time()
print(f"Exponential Smoothing without JIT took {end_time - start_time:.6f} seconds with 1,000,000 sample array.")

# Test the speed of fast_exponential_smoothing
start_time = time.time()
smoothed_result = fast_exponential_smoothing(large_array)
end_time = time.time()
print(f"Exponential Smoothing with JIT took {end_time - start_time:.6f} seconds with 1,000,000 sample array.")

# Test the speed of fast_exponential_smoothing
start_time = time.time()
smoothed_result = non_recursive_exponential_smoothing(large_array)
end_time = time.time()
print(f"Exponential Smoothing with no recursion or JIT took {end_time - start_time:.6f} seconds with 1,000,000 sample array.")

I attempted to create the non-recursive version to see if vectorized operations across arrays would make it go faster, but it seems to be pretty damn fast as it is. These results remained the same all the way up until I didn't have enough memory to make the array of random integers.

Let me know what you think about this in the comments. I am by no means a professional developer, so I am accepting all comments, criticisms, or educational opportunities.

Until next time.

Happy coding!

Declaração de lançamento Este artigo foi reproduzido em: https://dev.to/kanndide/using-jit-compilers-to-make-my-python-loops-slower-4m66?1 Se houver alguma violação, entre em contato com [email protected] para excluí-lo

Tutorial mais recente Mais>

Explorando os novos recursos do Java 23
Caros desenvolvedores, entusiastas de programação e alunos, Java Development Kit (JDK) 23 foi lançado oficialmente (2024/09/17 General Availability) m...

Programação Publicado em 2024-11-06
Desestruturação de array ES6: Por que não funciona conforme o esperado?
Desestruturação de array ES6: comportamento imprevistoNo ES6, a desestruturação de arrays pode levar a resultados inesperados, deixando os programador...

Programação Publicado em 2024-11-06
Como posso redimensionar uma imagem para caber na janela do navegador sem distorção?
Redimensionar uma imagem para caber na janela do navegador sem distorçãoRedimensionar uma imagem para caber na janela do navegador é uma tarefa comum ...

Programação Publicado em 2024-11-06
Orientação a Objetos - Métodos em Java
Na programação orientada a objetos em Java, os métodos desempenham um papel crucial na definição do comportamento das classes e objetos. Eles permitem...

Programação Publicado em 2024-11-06
Como corrigir o erro “Nenhum arquivo ou diretório” nas migrações do Laravel em um Mac usando MAMP?
Resolvendo o erro "Nenhum arquivo ou diretório" nas migrações do Laravel em um MacIntrodução: Ao tentar executar o comando “php crafts migra...

Programação Publicado em 2024-11-06
Princípios SOLID usando algumas analogias divertidas com Exemplo de Veículo
SOLID é um acrônimo para um grupo de cinco bons princípios (regras) em programação de computadores. SOLID permite que os programadores escre...

Programação Publicado em 2024-11-06
Como retornar um valor resolvido de uma função assíncrona dentro de outra função assíncrona?
Como retornar um valor de uma função assíncrona?No código fornecido, o método init() retorna uma promessa, mas o O método getPostById() está tentando ...

Programação Publicado em 2024-11-06
Aprenda como construir um jogo de xadrez multijogador com React
Hello and welcome! ?? Today I bring a tutorial to guide you through building a multiplayer chess game using SuperViz. Multiplayer games require real-t...

Programação Publicado em 2024-11-06
Como validar datas no formato DD/MM/AAAA usando expressão regular JavaScript?
Validando datas no formato DD/MM/AAAA usando expressões regulares JavaScriptValidar datas é uma tarefa comum na programação e a capacidade de garantir...

Programação Publicado em 2024-11-06
Limitação e redução em JavaScript: um guia para iniciantes
Ao usar JavaScript, o excesso de gatilhos de eventos pode tornar seu aplicativo mais lento. Por exemplo, um usuário que redimensiona a janela do naveg...

Programação Publicado em 2024-11-06
Como solucionar um erro 403 proibido ao importar um repositório Bitbucket privado no Go?
Solucionar problemas de importação de Go de um repositório privado do Bitbucket (403 Proibido)Importar um repositório privado do Bitbucket.org usando ...

Programação Publicado em 2024-11-06
Escopos Singleton e Prototype Spring Bean: uma exploração detalhada
Quando comecei a trabalhar com Spring, um dos conceitos que mais me intrigou foi a ideia de bean scopes. Spring fornece vários escopos de bean que det...

Programação Publicado em 2024-11-06
Como suavizar efetivamente curvas de dados barulhentas?
Suavização ideal de curvas ruidosasConsidere um conjunto de dados aproximado por:import numpy as np x = np.linspace(0, 2*np.pi, 100) y = np.sin(x) n...

Programação Publicado em 2024-11-06
Como renumerar um índice primário para valores sequenciais ordenados no MySQL?
Renumerando o índice primário para valores sequenciais ordenadosSe o índice primário (id) da sua tabela MySQL aparecer em uma ordem inconsistente (por...

Programação Publicado em 2024-11-06
Literais de objeto aprimorados
ES6 introduziu 3 maneiras de escrever literais de objetos Primeira maneira: - ES6 Enhanced object literal syntax can take an external object like sal...

Programação Publicado em 2024-11-06

Classificação Mais>

Aprenda japonês Aprender coreano Aprenda chinês Aprender língua estrangeira Jogo Problema comum Periféricos de tecnologia IA Tutorial de software Programação Artigo