README.md
1JSMN
2jsmn (pronounced like 'jasmine') is a minimalistic JSON parser in C. It can be easily integrated into resource-limited or embedded projects.
3You can find more information about JSON format at json.org
4Library sources are available at bitbucket.org/zserge/jsmn
5The web page with some information about jsmn can be found at http://zserge.com/jsmn.html
6Philosophy
7Most JSON parsers offer you a bunch of functions to load JSON data, parse it and extract any value by its name. jsmn proves that checking the correctness of every JSON packet or allocating temporary objects to store parsed JSON fields often is an overkill.
8JSON format itself is extremely simple, so why should we complicate it?
9jsmn is designed to be robust (it should work fine even with erroneous data), fast (it should parse data on the fly), portable (no superfluous dependencies or non-standard C extensions). An of course, simplicity is a key feature - simple code style, simple algorithm, simple integration into other projects.
10Features
11compatible with C89
12no dependencies (even libc!)
13highly portable (tested on x86/amd64, ARM, AVR)
14about 200 lines of code
15extremely small code footprint
16API contains only 2 functions
17no dynamic memory allocation
18incremental single-pass parsing
19library code is covered with unit-tests
20Design
21The rudimentary jsmn object is a token. Let's consider a JSON string:
22'{ "name" : "Jack", "age" : 27 }'
23It holds the following tokens:
24Object: { "name" : "Jack", "age" : 27} (the whole object)
25Strings: "name", "Jack", "age" (keys and some values)
26Number: 27
27In jsmn, tokens do not hold any data, but point to token boundaries in JSON string instead. In the example above jsmn will create tokens like: Object [0..31], String [3..7], String [12..16], String [20..23], Number [27..29].
28Every jsmn token has a type, which indicates the type of corresponding JSON token. jsmn supports the following token types:
29Object - a container of key-value pairs, e.g.: { "foo":"bar", "x":0.3 }
30Array - a sequence of values, e.g.: [ 1, 2, 3 ]
31String - a quoted sequence of chars, e.g.: "foo"
32Primitive - a number, a boolean (true, false) or null
33Besides start/end positions, jsmn tokens for complex types (like arrays or objects) also contain a number of child items, so you can easily follow object hierarchy.
34This approach provides enough information for parsing any JSON data and makes it possible to use zero-copy techniques.
35Install
36To clone the repository you should have mercurial installed. Just run:
37$ hg clone http://bitbucket.org/zserge/jsmn jsmn
38Repository layout is simple: jsmn.c and jsmn.h are library files, tests are in the jsmn_test.c, you will also find README, LICENSE and Makefile files inside.
39To build the library, run make. It is also recommended to run make test. Let me know, if some tests fail.
40If build was successful, you should get a libjsmn.a library. The header file you should include is called "jsmn.h".
41API
42Token types are described by jsmntype_t:
43typedef enum {
44 JSMN_PRIMITIVE = 0,
45 JSMN_OBJECT = 1,
46 JSMN_ARRAY = 2,
47 JSMN_STRING = 3
48} jsmntype_t;
49Note: Unlike JSON data types, primitive tokens are not divided into numbers, booleans and null, because one can easily tell the type using the first character:
50<code>'t', 'f'</code> - boolean
51<code>'n'</code> - null
52<code>'-', '0'..'9'</code> - number
53Token is an object of jsmntok_t type:
54typedef struct {
55 jsmntype_t type; // Token type
56 int start; // Token start position
57 int end; // Token end position
58 int size; // Number of child (nested) tokens
59} jsmntok_t;
60Note: string tokens point to the first character after the opening quote and the previous symbol before final quote. This was made to simplify string extraction from JSON data.
61All job is done by jsmn_parser object. You can initialize a new parser using:
62struct jsmn_parser parser;
63jsmntok_t tokens[10];
64
65// js - pointer to JSON string
66// tokens - an array of tokens available
67// 10 - number of tokens available
68jsmn_init_parser(&parser, js, tokens, 10);
69This will create a parser, that can parse up to 10 JSON tokens from js string.
70Later, you can use jsmn_parse(&parser) function to process JSON string with the parser.
71A non-negative value is the number of tokens actually used by the parser. Passing NULL instead of the tokens array would not store parsing results, but instead the function will return the value of tokens needed to parse the given string. This can be useful if you don't know yet how many tokens to allocate.
72If something goes wrong, you will get an error. Error will be one of these:
73JSMN_ERROR_INVAL - bad token, JSON string is corrupted
74JSMN_ERROR_NOMEM - not enough tokens, JSON string is too large
75JSMN_ERROR_PART - JSON string is too short, expecting more JSON data
76If you get JSON_ERROR_NOMEM, you can re-allocate more tokens and call jsmn_parse once more. If you read json data from the stream, you can periodically call jsmn_parse and check if return value is JSON_ERROR_PART. You will get this error until you reach the end of JSON data.
77Other info
78This software is distributed under MIT license, so feel free to integrate it in your commercial products.
79