sqlparse Using parsestream blocks until fully read

I was trying to combine sqlparse.parsestream with reading from stdin, but could not get it to work properly. Here's my approach:

stream.py:

import sys
import os
import sqlparse

#newin = os.fdopen(sys.stdin.fileno(), 'r', 1)

s = sqlparse.parsestream(sys.stdin)

for i in s:
    print i

Calling it with:

for i in `seq 1 5`; do echo select $i\;; sleep 1; done | python stream.py

The output becomes available after 5 seconds. I was expecting parsestream to output each statement as soon as it is read from stdin. Also, I tried to create a separate newin that uses a buffersize of 1.

Is there anything I'm doing wrong here?

I'm in OS X Yosemite with Python 2.7.9.

Dec 22 '14 20:12 resamsel

That's an interesting use case! ATM parsestream relies on EOF and is intended to work with patterns like "echo 'select * from foo' | stream.py". Are you trying to format log files in a tail-like manner?

For now parsestream doesn't try to deteremine the end of an statement at all (albeit it might be possible). This would be required for that use case then.

Dec 28 '14 18:12 andialbrecht

Thank's for you reply! I'm having a large file with lots of SQL statements (70K+ lines) that I want to start as soon as the first statement has been read successfully. At the same time I want to be able to create SQL statements with a script and pipe it to my stream.py.

I already use sqlparse to read SQL statements from a file/stdin and execute them (see https://github.com/resamsel/dbnavigator/blob/master/src/dbnav/executer/init.py if you're interested in what I'm currently working on). I'm trying to replace the read_statements function with a stream, to work around the long time it takes to read and parse all of those 70K+ lines...

Dec 29 '14 10:12 resamsel

+1 I'm interested in reading a database dump (.sql) and being able to extract information from Python without needing to actually stage it in a real database. I'm working with files > 100k lines. The performance is okay so far, but as I scale up, it would be nice if the generator could start returning results without needing to read the entire file into memory.

May 12 '15 20:05 mehaase

+1 i would like to use this library for parsing large sql dump files

Sep 13 '15 06:09 c3c

will anyone be able to provide an example and test for this? I think I see what's going on, but would like to validate

Jun 02 '16 20:06 vmuriart

@vmuriart, what's wrong with the example and test in the original post? I'd be happy to provide more information, if you need any.

Jun 03 '16 07:06 resamsel

I'm on a Windows computer :sob:. Specifically though, I was looking for something that could be added to the libraries tests

Jun 03 '16 12:06 vmuriart

Here are some free SQL dumps found via Google. Some are quite large.

http://dev.mysql.com/doc/index-other.html http://sportsdb.org/sd/samples More: https://www.google.com/search?safe=off&q=filetype%3Asql

Jun 03 '16 13:06 mehaase

Thanks @mehaase. I meant an example that can be included on the tests module. I am having a hard time coming up with a way to automate this test.

Jun 03 '16 20:06 vmuriart

Sorry for reviving an old issue, but it would be great if something similar (exactly the same?) as what @Toilal has added in the fork was added upstream!

Oct 11 '22 12:10 abudis

sqlparse sqlparse copied to clipboard

Using parsestream blocks until fully read

sqlparse
sqlparse copied to clipboard