Skip to content Skip to sidebar Skip to footer

Printing To The Penultimate Line Of A File

I am wondering if there is a simple way to get to the penultimate line of an open file. f.seek is giving me no end of trouble. I can easily get to the final line, but I can't figur

Solution 1:

Assuming the file isn't too big and memory isn't a concern

open('file.txt').readlines()[-2]

Solution 2:

You can seek from the end of the file and count number of newlines encountered, as soon as you hit the second '\n' stop and call readline():

withopen('foo.txt') as f:
    end_count = 0
    n = -1while end_count != 2:
        f.seek(n, 2)
        if f.read(1) == '\n':
            end_count += 1
        n -= 1printrepr(f.readline())

For a file like:

first line
second line
third line
fourth line
I want this line
last line

The output will be:

'I want this line\n'

Solution 3:

Files are a single long string of bytes on most systems (some have forks, extents or records), leaving the concept of lines to a higher level. Complicating matters further, the line ending doesn't look the same way on all platforms. This means you have to read the lines to identify them, and specifically for text files you can only seek() to places you found using tell().

If we're just reading the penultimate line, it's simple:

alllines=fileobject.readlines()
penultimateline=alllines[-2]

That approach loads the entire file into memory. If we want to replace the end of the file, starting with the penultimate line, things get hairier:

pen,last = 0,0while True:
  last,pen = fileobject.tell(), last
  line = fileobject.readline()
  ifnot line:
    break# back up to the penultimate line
fileobject.seek(pen)    # Note: seek is *required* to switch read/write
fileobject.truncate()
fileobject.write("Ate last two lines, this is a new line.\n")

If you merely want to read lines in an arbitrary order, linecache might be helpful.

Each of these scans the entire file. Tools like tail may make another optimization: read data near the end of the file, until you've found enough newlines to identify the lines you need. This gets more complicated beause the seeking only works predictably in binary mode but the line parsing only works predictably in text mode. That in turn means our guess that the file is separated by linesep could be wrong; Python's universal newline support only operates in text mode.

backsearch=0
lines=[]
while len(lines)<=2:
  backsearch+=200
  fileobject.seek(-backsearch, 2)
  if fileobject.tell()==0:
    break# ran out of file while scanning backwards
  lines=fileobject.read().split(os.linesep)
fileobject.seek(-backsearch, 2)
# Now repeat the earlier method, knowing you're only processing # the final part of the file. 

Solution 4:

defpenultimate(file_path)
    returnopen(file_path).read().splitlines()[len(open(file_path).read().splitlines()) - 2]

Post a Comment for "Printing To The Penultimate Line Of A File"