A really simple and lightweight, incremental parser for JSON streaming (concatenated JSON and line delimited JSON), in PHP. You can use this library to process a stream of data that consists of multiple JSON documents.
A line delimited JSON example stream consisting of 3 individual JSON documents could look like this:
{ "id": 1, "name": "first" }
{ "id": 3, "name": "third" }
{ "id": 6, "name": "sixth" }
The whitespace between the individual JSON documents is optional. Instead of newlines, you can use any number of whitespace or none at all.
A concatenated JSON example stream consisting of 3 individual JSON documents could look like this:
{ "id": 1, "name": "first" }{ "id": 3, "name": "third" }{ "id": 6, "name": "sixth"}
The input stream can be of arbitrary size and can be interrupted at any time. This is often useful for processing network streams, where the chunk/buffer size is not under your control and you could potentially read single bytes only.
Please note that this library is about processing a stream that can contain any number of JSON documents. It is assumed that each document has a reasonable size and fits into memory. This is not to be confused with a streaming parser for processing a single, huge JSON document that is too big to fit into memory.
Note: This project is in beta stage! Feel free to report any issues you encounter.
Once installed, you can use the following sample code to parse a stream of JSON chunks:
$parser = new StreamingJsonParser();
assert($parser->push('[ 1, 2') === array());
assert($parser->push('3 ]') === array(array(1, 2, 3));
assert($parser->push('{} {}') === array(array(), array());
This is actually only a really simple hack to call a normal document based parser whenever it thinks a full document has been found in the input stream. Because the normal parser is implemented as an extension (instead of userland PHP), this turns out to be pretty fast for streams that contain common, rather small objects.
You might want to use this if
- you have to deal with a stream of multiple JSON documents
- you have to handle chunks of incomplete JSON documents
- you prefer a lightweight parser
You probably don't want to use this if
- you deal with complete JSON documents
- you have a proper delimiter (such as newlines) between your individual JSON documents
- your JSON documents are too big to fit into RAM
- you have a CS background and/or are in love with actual incremental, recursive parsers
The recommended way to install this library is through Composer. New to Composer?
{
"require": {
"clue/json-stream": "~0.1.0"
}
}
See also the CHANGELOG for details about version upgrades.
This project aims to run on any platform and thus does not require any PHP extensions and supports running on legacy PHP 5.3 through current PHP 8+ and HHVM. It's highly recommended to use PHP 7+ for this project.
To run the test suite, you first need to clone this repo and then install all dependencies through Composer:
$ composer install
To run the test suite, go to the project root and run:
$ php vendor/bin/phpunit
MIT