如何有效地跳到大型文字檔案中的特定行？

首頁 > 程式設計 > 如何有效地跳到大型文字檔案中的特定行？

如何有效地跳到大型文字檔案中的特定行？

發佈於2024-11-06

How can I efficiently jump to a specific line in a large text file?

優化大型文字檔案中的跳行：另一種方法

處理大量不同長度行的文字檔案時，通常效率很低順序讀取每一行以達到特定的行號。問題中提供的程式碼範例說明了這種方法，需要對整個文件進行可能緩慢的迭代。然而，還有一種替代方法可以透過利用計算出的偏移列表來優化跳線。

基於偏移的跳線

要克服這項挑戰，需要一種更有效的方法涉及讀取檔案一次以建立行偏移清單。每個偏移量都標記特定行的起始位置。透過儲存這些偏移量，您可以直接跳到所需的行，而無需處理前面的行。

這是一個改進的程式碼片段：

# Read the file and build the line offset list
line_offset = []
offset = 0
with open(filename, "rb", 0) as file:
    for line in file:
        line_offset.append(offset)
        offset  = len(line)

# Jump to a specific line (line 141978 in this example)
file.seek(line_offset[141977])  # Adjust the index as lines are zero-indexed

# Process the target line as desired
DoSomethingWithThisLine(line)

利用行偏移列表，可以直接跳到目標行，顯著減少處理時間，提高效率。