Iterate a large .xz file line by line in python

I was faced to the same question some weeks ago. This snippet worked for me:

import lzma
with lzma.open('filename.xz', mode='rt') as file:
    for line in file:
       print(line)

This assumes that the text data in the compressed file was encoded in utf-8 (which was the case for my data). There is an encoding argument in function lzma.open() which allows you to set another encoding if needed

EDIT (after you own edit): try to force encoding='utf-8' in lmza.open()

Is it possible to work on a project from another computer through GitHub?

A difficult symmetric inequality

Proving a number defined by a sequence is a square number

Does $\mu^{*}(E)=1$ imply $\mu^{*}(E^{c})=0$ when $\mu$ is an outer measure and the measure of the space is $1$

A commutator identity for bounded linear maps and the identity operator of a non-zero normed space is never a commutator

How to find finite trigonometric products [closed]

Is the graceful labeling conjecture still unsolved?

Numerical solution to x = tan (x)

Question about members in sets

If $I$ is a finitely generated ideal of $A[X]$, is $I\cap A$ necessarily finitely generated for a commutative unital ring $A$?

Prove $\left(\frac{n+1}{\text{e}}\right)^n<n!<\text{e}\left(\frac{n+1}{\text{e}}\right)^{n+1}$ [closed]

How to prove there is no positive and continuous function satisfying some conditions

Iterate a large .xz file line by line in python

Related

Recent Posts