cpp-peglib/README.md

cpp-peglib
==========

C++11 header-only [PEG](http://en.wikipedia.org/wiki/Parsing_expression_grammar) (Parsing Expression Grammars) library.

*cpp-peglib* tries to provide more expressive parsing experience in a simple way. This library depends on only one header file. So, you can start using it right away just by including `peglib.h` in your project.

The PEG syntax is well described on page 2 in the [document](http://pdos.csail.mit.edu/papers/parsing:popl04.pdf). *cpp-peglib* also supports the following additional syntax for now:

  * `<` ... `>` (Token boundary operator)
  * `$<` ... `>` (Capture operator)
  * `$name<` ... `>` (Named capture operator)
  * `~` (Ignore operator)
  * `\x20` (Hex number char)

This library also supports the linear-time parsing known as the [*Packrat*](http://pdos.csail.mit.edu/~baford/packrat/thesis/thesis.pdf) parsing.

How to use
----------

This is a simple calculator sample. It shows how to define grammar, associate samantic actions to the grammar and handle semantic values.

```c++
// (1) Include the header file
#include <peglib.h>
#include <assert.h>

using namespace peg;
using namespace std;

int main(void) {
    // (2) Make a parser
    auto syntax = R"(
        # Grammar for Calculator...
        Additive  <- Multitive '+' Additive / Multitive
        Multitive <- Primary '*' Multitive / Primary
        Primary   <- '(' Additive ')' / Number
        Number    <- [0-9]+
    )";

    parser parser(syntax);

    // (3) Setup an action
    parser["Additive"] = [](const SemanticValues& sv) {
        switch (sv.choice) {
        case 0:  // "Multitive '+' Additive"
            return sv[0].get<int>() + sv[1].get<int>();
        default: // "Multitive"
            return sv[0].get<int>();
        }
    };

    parser["Multitive"] = [](const SemanticValues& sv) {
        switch (sv.choice) {
        case 0:  // "Primary '*' Multitive"
            return sv[0].get<int>() * sv[1].get<int>();
        default: // "Primary"
            return sv[0].get<int>();
        }
    };

    parser["Number"] = [](const SemanticValues& sv) {
        return stoi(sv.str(), nullptr, 10);
    };

    // (4) Parse
    parser.packrat_parsing(); // Enable packrat parsing.

    int val;
    parser.parse("(1+2)*3", val);

    assert(val == 9);
}
```

Here are available actions:

```c++
[](const SemanticValues& sv, any& dt)
[](const SemanticValues& sv)
```

`const SemanticValues& sv` contains semantic values. `SemanticValues` structure is defined as follows.

```c++
struct SemanticValue {
    any         val;  // Semantic value
    const char* name; // Definition name for the sematic value
    const char* s;    // Token start for the semantic value
    size_t      n;    // Token length for the semantic value

    // Cast semantic value
    template <typename T> T& get();
    template <typename T> const T& get() const;

    // Get token
    std::string str() const;
};

struct SemanticValues : protected std::vector<SemanticValue>
{
    const char* s;      // Token start
    size_t      n;      // Token length
    size_t      choice; // Choice number (0 based index)

    // Get token
    std::string str() const;

    // Transform the semantice value vector to another vector
    template <typename T> vector<T> transform(size_t beg = 0, size_t end = -1) const;
}
```

`peg::any` class is very similar to [boost::any](http://www.boost.org/doc/libs/1_57_0/doc/html/any.html). You can obtain a value by castning it to the actual type. In order to determine the actual type, you have to check the return value type of the child action for the semantic value.

`const char* s, size_t n` gives a pointer and length of the matched string. This is same as `sv.s` and `sv.n`.

`any& dt` is a data object which can be used by the user for whatever purposes.

The following example uses `<` ... ` >` operators. They are the *token boundary* operators. Each token boundary operator creates a semantic value that contains `const char*` of the position. It could be useful to eliminate unnecessary characters.

```c++
auto syntax = R"(
    ROOT  <- _ TOKEN (',' _ TOKEN)*
    TOKEN <- < [a-z0-9]+ > _
    _     <- [ \t\r\n]*
)";

peg pg(syntax);

pg["TOKEN"] = [](const SemanticValues& sv) {
    // 'token' doesn't include trailing whitespaces
    auto token = sv.str();
};

auto ret = pg.parse(" token1, token2 ");
```

We can ignore unnecessary semantic values from the list by using `~` operator.

```c++
peg::pegparser parser(
    "  ROOT  <-  _ ITEM (',' _ ITEM _)*  "
    "  ITEM  <-  ([a-z])+                "
    "  ~_    <-  [ \t]*                  "
);

parser["ROOT"] = [&](const SemanticValues& sv) {
    assert(sv.size() == 2); // should be 2 instead of 5.
};

auto ret = parser.parse(" item1, item2 ");
```

The following grammar is same as the above.

```c++
peg::parser parser(
    "  ROOT  <-  ~_ ITEM (',' ~_ ITEM ~_)*  "
    "  ITEM  <-  ([a-z])+                   "
    "  _     <-  [ \t]*                     "
);
```

*Semantic predicate* support is available. We can do it by throwing a `peg::parse_error` exception in a semantic action.

```c++
peg::parser parser("NUMBER  <-  [0-9]+");

parser["NUMBER"] = [](const SemanticValues& sv) {
    auto val = stol(sv.str(), nullptr, 10);
    if (val != 100) {
        throw peg::parse_error("value error!!");
    }
    return val;
};

long val;
auto ret = parser.parse("100", val);
assert(ret == true);
assert(val == 100);

ret = parser.parse("200", val);
assert(ret == false);
```

Simple interface
----------------

*cpp-peglib* provides std::regex-like simple interface for trivial tasks.

`peg::peg_match` tries to capture strings in the `$< ... >` operator and store them into `peg::match` object.

```c++
peg::match m;

auto ret = peg::peg_match(
    R"(
        ROOT      <-  _ ('[' $< TAG_NAME > ']' _)*
        TAG_NAME  <-  (!']' .)+
        _         <-  [ \t]*
    )",
    " [tag1] [tag:2] [tag-3] ",
    m);

assert(ret == true);
assert(m.size() == 4);
assert(m.str(1) == "tag1");
assert(m.str(2) == "tag:2");
assert(m.str(3) == "tag-3");
```

It also supports named capture with the `$name<` ... `>` operator.

```c++
peg::match m;

auto ret = peg::peg_match(
    R"(
        ROOT      <-  _ ('[' $test< TAG_NAME > ']' _)*
        TAG_NAME  <-  (!']' .)+
        _         <-  [ \t]*
    )",
    " [tag1] [tag:2] [tag-3] ",
    m);

auto cap = m.named_capture("test");

REQUIRE(ret == true);
REQUIRE(m.size() == 4);
REQUIRE(cap.size() == 3);
REQUIRE(m.str(cap[2]) == "tag-3");
```

There are some ways to *search* a peg pattern in a document.

```c++
using namespace peg;

auto syntax = R"(
    ROOT <- '[' $< [a-z0-9]+ > ']'
)";

auto s = " [tag1] [tag2] [tag3] ";

// peg::peg_search
parser pg(syntax);
size_t pos = 0;
auto n = strlen(s);
match m;
while (peg_search(pg, s + pos, n - pos, m)) {
    cout << m.str()  << endl; // entire match
    cout << m.str(1) << endl; // submatch #1
    pos += m.length();
}

// peg::peg_token_iterator
peg_token_iterator it(syntax, s);
while (it != peg_token_iterator()) {
    cout << it->str()  << endl; // entire match
    cout << it->str(1) << endl; // submatch #1
    ++it;
}

// peg::peg_token_range
for (auto& m: peg_token_range(syntax, s)) {
    cout << m.str()  << endl; // entire match
    cout << m.str(1) << endl; // submatch #1
}
```

Make a parser with parser operators
-----------------------------------

Instead of makeing a parser by parsing PEG syntax text, we can also construct a parser by hand with *parser operators*. Here is an example:

```c++
using namespace peg;
using namespace std;

vector<string> tags;

Definition ROOT, TAG_NAME, _;
ROOT     <= seq(_, zom(seq(chr('['), TAG_NAME, chr(']'), _)));
TAG_NAME <= oom(seq(npd(chr(']')), dot())), [&](const SemanticValues& sv) {
                tags.push_back(sv.str());
            };
_        <= zom(cls(" \t"));

auto ret = ROOT.parse(" [tag1] [tag:2] [tag-3] ");
```

The following are available operators:

| Operator |     Description       |
| :------- | :-------------------- |
| seq      | Sequence              |
| cho      | Prioritized Choice    |
| zom      | Zero or More          |
| oom      | One or More           |
| opt      | Optional              |
| apd      | And predicate         |
| npd      | Not predicate         |
| lit      | Literal string        |
| cls      | Character class       |
| chr      | Character             |
| dot      | Any character         |
| tok      | Token boundary        |
| ign      | Ignore semantic value |
| cap      | Capture character     |
| usr      | User defiend parser   |

Adjust definitions
------------------

It's possible to add/override definitions.

```c++
auto syntax = R"(
    ROOT <- _ 'Hello' _ NAME '!' _
)";

Rules additional_rules = {
    {
        "NAME", usr([](const char* s, size_t n, SemanticValues& sv, any& c) -> size_t {
            static vector<string> names = { "PEG", "BNF" };
            for (const auto& name: names) {
                if (name.size() <= n && !name.compare(0, name.size(), s, name.size())) {
                    return name.size(); // processed length
                }
            }
            return -1; // parse error
        })
    },
    {
        "~_", zom(cls(" \t\r\n"))
    }
};

auto g = parser(syntax, additional_rules);

assert(g.parse(" Hello BNF! "));
```

Sample codes
------------

  * [Calculator](https://github.com/yhirose/cpp-peglib/blob/master/example/calc.cc)
  * [Calculator (with parser operators)](https://github.com/yhirose/cpp-peglib/blob/master/example/calc2.cc)
  * [Calculator (AST version)](https://github.com/yhirose/cpp-peglib/blob/master/example/calc3.cc)
  * [PEG syntax Lint utility](https://github.com/yhirose/cpp-peglib/blob/master/lint/cmdline/peglint.cc)
  * [PL/0 Interpreter](https://github.com/yhirose/cpp-peglib/blob/master/language/pl0/pl0.cc)

Tested Compilers
----------------

  * Visual Studio 2015
  * Clang 3.5

TODO
----

  * Unicode support

License
-------

MIT license (© 2015 Yuji Hirose)
Uploaded files. 2015-02-08 01:52:26 +00:00			`cpp-peglib`
			`==========`

			`C++11 header-only [PEG](http://en.wikipedia.org/wiki/Parsing_expression_grammar) (Parsing Expression Grammars) library.`

Updated documentation. 2015-02-13 02:08:58 +00:00			cpp-peglib tries to provide more expressive parsing experience in a simple way. This library depends on only one header file. So, you can start using it right away just by including `peglib.h` in your project.
Uploaded files. 2015-02-08 01:52:26 +00:00
Updated documentation. 2015-02-16 01:22:34 +00:00			`The PEG syntax is well described on page 2 in the [document](http://pdos.csail.mit.edu/papers/parsing:popl04.pdf). cpp-peglib also supports the following additional syntax for now:`

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			* `<` ... `>` (Token boundary operator)
Added 'ignore' operator. 2015-02-18 23:00:11 +00:00			* `$<` ... `>` (Capture operator)
Updated README. 2015-03-04 03:08:18 +00:00			* `$name<` ... `>` (Named capture operator)
Added 'ignore' operator. 2015-02-18 23:00:11 +00:00			* `~` (Ignore operator)
Updated README. 2015-03-04 03:08:18 +00:00			* `\x20` (Hex number char)
Uploaded files. 2015-02-08 01:52:26 +00:00
Updated public interface. 2015-03-03 02:52:09 +00:00			`This library also supports the linear-time parsing known as the [Packrat](http://pdos.csail.mit.edu/~baford/packrat/thesis/thesis.pdf) parsing.`
Moved 'choice' property to SemanticValues. 2015-02-27 03:40:00 +00:00
Uploaded files. 2015-02-08 01:52:26 +00:00			`How to use`
			`----------`

Added simple interface. 2015-02-15 22:52:39 +00:00			`This is a simple calculator sample. It shows how to define grammar, associate samantic actions to the grammar and handle semantic values.`
Uploaded files. 2015-02-08 01:52:26 +00:00
			```c++
Added simple interface. 2015-02-15 22:52:39 +00:00			`// (1) Include the header file`
Updated README. 2015-02-12 02:04:08 +00:00			`#include <peglib.h>`
Updated documentation and examples. 2015-03-11 17:53:24 +00:00			`#include <assert.h>`
Updated README. 2015-02-12 02:04:08 +00:00
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`using namespace peg;`
Updated README. 2015-02-12 02:04:08 +00:00			`using namespace std;`

			`int main(void) {`
Added simple interface. 2015-02-15 22:52:39 +00:00			`// (2) Make a parser`
Updated documentation. 2015-02-13 02:08:58 +00:00			`auto syntax = R"(`
			`# Grammar for Calculator...`
			`Additive <- Multitive '+' Additive / Multitive`
			`Multitive <- Primary '*' Multitive / Primary`
			`Primary <- '(' Additive ')' / Number`
			`Number <- [0-9]+`
			`)";`

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`parser parser(syntax);`
Updated documentation. 2015-02-13 02:08:58 +00:00
Added simple interface. 2015-02-15 22:52:39 +00:00			`// (3) Setup an action`
Simplefied code. 2015-06-16 03:26:49 +00:00			`parser["Additive"] = [](const SemanticValues& sv) {`
			`switch (sv.choice) {`
			`case 0: // "Multitive '+' Additive"`
			`return sv[0].get<int>() + sv[1].get<int>();`
			`default: // "Multitive"`
			`return sv[0].get<int>();`
			`}`
Updated documentation. 2015-02-13 02:08:58 +00:00			`};`

Moved 'choice' property to SemanticValues. 2015-02-27 03:40:00 +00:00			`parser["Multitive"] = [](const SemanticValues& sv) {`
			`switch (sv.choice) {`
Simplefied code. 2015-06-16 03:26:49 +00:00			`case 0: // "Primary '*' Multitive"`
Name refactoring. 2015-03-09 18:58:43 +00:00			`return sv[0].get<int>() * sv[1].get<int>();`
Simplefied code. 2015-06-16 03:26:49 +00:00			`default: // "Primary"`
Name refactoring. 2015-03-09 18:58:43 +00:00			`return sv[0].get<int>();`
Moved 'choice' property to SemanticValues. 2015-02-27 03:40:00 +00:00			`}`
Updated documentation. 2015-02-13 02:08:58 +00:00			`};`

Simplefiled API. 2015-06-16 04:25:01 +00:00			`parser["Number"] = [](const SemanticValues& sv) {`
Added str() in SemanticValues. 2015-06-16 04:43:08 +00:00			`return stoi(sv.str(), nullptr, 10);`
Updated documentation. 2015-02-13 02:08:58 +00:00			`};`

Added simple interface. 2015-02-15 22:52:39 +00:00			`// (4) Parse`
Simplefied code. 2015-06-16 03:26:49 +00:00			`parser.packrat_parsing(); // Enable packrat parsing.`
Updated documentation and examples. 2015-03-11 17:53:24 +00:00
Updated documentation. 2015-02-13 02:08:58 +00:00			`int val;`
Name refactoring. 2015-03-09 18:58:43 +00:00			`parser.parse("(1+2)*3", val);`
Updated documentation. 2015-02-13 02:08:58 +00:00
Fixed sample. 2015-02-16 03:21:18 +00:00			`assert(val == 9);`
Updated README. 2015-02-12 02:04:08 +00:00			`}`
			```
Uploaded files. 2015-02-08 01:52:26 +00:00
Simplefiled API. 2015-06-16 04:25:01 +00:00			`Here are available actions:`
Added simple interface. 2015-02-15 22:52:39 +00:00
			```c++
Changed the semantic values interface. 2015-02-22 00:38:30 +00:00			`[](const SemanticValues& sv, any& dt)`
			`[](const SemanticValues& sv)`
Added simple interface. 2015-02-15 22:52:39 +00:00			```

Changed the semantic values interface. 2015-02-22 00:38:30 +00:00			`const SemanticValues& sv` contains semantic values. `SemanticValues` structure is defined as follows.
Added 'const SemanticValues&` action. 2015-02-19 03:28:57 +00:00
			```c++
Changed the semantic values interface. 2015-02-22 00:38:30 +00:00			`struct SemanticValue {`
Corrected README. 2015-06-16 05:04:01 +00:00			`any val; // Semantic value`
			`const char* name; // Definition name for the sematic value`
Changed the semantic values interface. 2015-02-22 00:38:30 +00:00			`const char* s; // Token start for the semantic value`
Name refactoring. 2015-03-09 18:58:43 +00:00			`size_t n; // Token length for the semantic value`
Updated documentation. 2015-03-11 18:10:59 +00:00
Added str() in SemanticValues. 2015-06-16 04:43:08 +00:00			`// Cast semantic value`
Updated documentation. 2015-03-11 18:10:59 +00:00			`template <typename T> T& get();`
			`template <typename T> const T& get() const;`
Added str() in SemanticValue. 2015-06-16 17:15:27 +00:00
			`// Get token`
			`std::string str() const;`
Added 'const SemanticValues&` action. 2015-02-19 03:28:57 +00:00			`};`
Changed the semantic values interface. 2015-02-22 00:38:30 +00:00
			`struct SemanticValues : protected std::vector<SemanticValue>`
			`{`
Moved 'choice' property to SemanticValues. 2015-02-27 03:40:00 +00:00			`const char* s; // Token start`
Name refactoring. 2015-03-09 18:58:43 +00:00			`size_t n; // Token length`
Moved 'choice' property to SemanticValues. 2015-02-27 03:40:00 +00:00			`size_t choice; // Choice number (0 based index)`
Updated documentation. 2015-03-11 18:10:59 +00:00
Added str() in SemanticValues. 2015-06-16 04:43:08 +00:00			`// Get token`
			`std::string str() const;`

Simplified API. 2015-06-16 05:01:02 +00:00			`// Transform the semantice value vector to another vector`
Corrected README. 2015-06-16 05:04:01 +00:00			`template <typename T> vector<T> transform(size_t beg = 0, size_t end = -1) const;`
Changed the semantic values interface. 2015-02-22 00:38:30 +00:00			`}`
Added 'const SemanticValues&` action. 2015-02-19 03:28:57 +00:00			```

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`peg::any` class is very similar to [boost::any](http://www.boost.org/doc/libs/1_57_0/doc/html/any.html). You can obtain a value by castning it to the actual type. In order to determine the actual type, you have to check the return value type of the child action for the semantic value.
Changed the semantic values interface. 2015-02-22 00:38:30 +00:00
Name refactoring. 2015-03-09 18:58:43 +00:00			`const char* s, size_t n` gives a pointer and length of the matched string. This is same as `sv.s` and `sv.n`.
Changed the semantic values interface. 2015-02-22 00:38:30 +00:00
			`any& dt` is a data object which can be used by the user for whatever purposes.

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			The following example uses `<` ... ` >` operators. They are the token boundary operators. Each token boundary operator creates a semantic value that contains `const char*` of the position. It could be useful to eliminate unnecessary characters.
Added 'anchor' support. Removed implecit cast operators from 'any'. 2015-02-16 01:11:02 +00:00
			```c++
			`auto syntax = R"(`
			`ROOT <- _ TOKEN (',' _ TOKEN)*`
			`TOKEN <- < [a-z0-9]+ > _`
			`_ <- [ \t\r\n]*`
			`)";`

			`peg pg(syntax);`

Simplefiled API. 2015-06-16 04:25:01 +00:00			`pg["TOKEN"] = [](const SemanticValues& sv) {`
Changed the capture operator and made the anchor operator. 2015-02-18 03:35:07 +00:00			`// 'token' doesn't include trailing whitespaces`
Added str() in SemanticValues. 2015-06-16 04:43:08 +00:00			`auto token = sv.str();`
Added 'anchor' support. Removed implecit cast operators from 'any'. 2015-02-16 01:11:02 +00:00			`};`

			`auto ret = pg.parse(" token1, token2 ");`
			```

Added 'ignore' operator. 2015-02-18 23:00:11 +00:00			We can ignore unnecessary semantic values from the list by using `~` operator.

			```c++
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`peg::pegparser parser(`
Added more information about the ignore operator. 2015-06-13 05:27:49 +00:00			`" ROOT <- _ ITEM (',' _ ITEM _)* "`
			`" ITEM <- ([a-z])+ "`
			`" ~_ <- [ \t]* "`
Added 'ignore' operator. 2015-02-18 23:00:11 +00:00			`);`

Changed the semantic values interface. 2015-02-22 00:38:30 +00:00			`parser["ROOT"] = [&](const SemanticValues& sv) {`
			`assert(sv.size() == 2); // should be 2 instead of 5.`
Added 'ignore' operator. 2015-02-18 23:00:11 +00:00			`};`

			`auto ret = parser.parse(" item1, item2 ");`
			```

Added more information about the ignore operator. 2015-06-13 05:27:49 +00:00			`The following grammar is same as the above.`

			```c++
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`peg::parser parser(`
Added more information about the ignore operator. 2015-06-13 05:27:49 +00:00			`" ROOT <- ~_ ITEM (',' ~_ ITEM ~_)* "`
			`" ITEM <- ([a-z])+ "`
			`" _ <- [ \t]* "`
			`);`
			```

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			Semantic predicate support is available. We can do it by throwing a `peg::parse_error` exception in a semantic action.
Added semantic predicate support. 2015-06-15 20:07:25 +00:00
			```c++
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`peg::parser parser("NUMBER <- [0-9]+");`
Added semantic predicate support. 2015-06-15 20:07:25 +00:00
Simplefiled API. 2015-06-16 04:25:01 +00:00			`parser["NUMBER"] = [](const SemanticValues& sv) {`
Added str() in SemanticValues. 2015-06-16 04:43:08 +00:00			`auto val = stol(sv.str(), nullptr, 10);`
Added semantic predicate support. 2015-06-15 20:07:25 +00:00			`if (val != 100) {`
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`throw peg::parse_error("value error!!");`
Added semantic predicate support. 2015-06-15 20:07:25 +00:00			`}`
			`return val;`
			`};`

			`long val;`
			`auto ret = parser.parse("100", val);`
			`assert(ret == true);`
			`assert(val == 100);`

			`ret = parser.parse("200", val);`
			`assert(ret == false);`
			```

Added simple interface. 2015-02-15 22:52:39 +00:00			`Simple interface`
			`----------------`

			`cpp-peglib provides std::regex-like simple interface for trivial tasks.`

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`peg::peg_match` tries to capture strings in the `$< ... >` operator and store them into `peg::match` object.
Added simple interface. 2015-02-15 22:52:39 +00:00
			```c++
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`peg::match m;`
Added the named capture explanation in README. 2015-06-13 21:11:27 +00:00
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`auto ret = peg::peg_match(`
Added simple interface. 2015-02-15 22:52:39 +00:00			`R"(`
Changed the capture operator and made the anchor operator. 2015-02-18 03:35:07 +00:00			`ROOT <- _ ('[' $< TAG_NAME > ']' _)*`
Added simple interface. 2015-02-15 22:52:39 +00:00			`TAG_NAME <- (!']' .)+`
			`_ <- [ \t]*`
			`)",`
			`" [tag1] [tag:2] [tag-3] ",`
			`m);`

			`assert(ret == true);`
			`assert(m.size() == 4);`
			`assert(m.str(1) == "tag1");`
			`assert(m.str(2) == "tag:2");`
			`assert(m.str(3) == "tag-3");`
			```

Added the named capture explanation in README. 2015-06-13 21:11:27 +00:00			It also supports named capture with the `$name<` ... `>` operator.

			```c++
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`peg::match m;`
Added the named capture explanation in README. 2015-06-13 21:11:27 +00:00
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`auto ret = peg::peg_match(`
Updated README. 2015-06-14 12:02:59 +00:00			`R"(`
			`ROOT <- _ ('[' $test< TAG_NAME > ']' _)*`
			`TAG_NAME <- (!']' .)+`
			`_ <- [ \t]*`
			`)",`
Added the named capture explanation in README. 2015-06-13 21:11:27 +00:00			`" [tag1] [tag:2] [tag-3] ",`
			`m);`

			`auto cap = m.named_capture("test");`

			`REQUIRE(ret == true);`
			`REQUIRE(m.size() == 4);`
			`REQUIRE(cap.size() == 3);`
			`REQUIRE(m.str(cap[2]) == "tag-3");`
			```

Added simple interface. 2015-02-15 22:52:39 +00:00			`There are some ways to search a peg pattern in a document.`

			```c++
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`using namespace peg;`
Added simple interface. 2015-02-15 22:52:39 +00:00
			`auto syntax = R"(`
Updated README. 2015-06-15 21:47:19 +00:00			`ROOT <- '[' $< [a-z0-9]+ > ']'`
Added simple interface. 2015-02-15 22:52:39 +00:00			`)";`

			`auto s = " [tag1] [tag2] [tag3] ";`

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`// peg::peg_search`
			`parser pg(syntax);`
Added simple interface. 2015-02-15 22:52:39 +00:00			`size_t pos = 0;`
Name refactoring. 2015-03-09 18:58:43 +00:00			`auto n = strlen(s);`
Added simple interface. 2015-02-15 22:52:39 +00:00			`match m;`
Name refactoring. 2015-03-09 18:58:43 +00:00			`while (peg_search(pg, s + pos, n - pos, m)) {`
Updated README. 2015-06-15 21:47:19 +00:00			`cout << m.str() << endl; // entire match`
			`cout << m.str(1) << endl; // submatch #1`
			`pos += m.length();`
Added simple interface. 2015-02-15 22:52:39 +00:00			`}`

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`// peg::peg_token_iterator`
Added simple interface. 2015-02-15 22:52:39 +00:00			`peg_token_iterator it(syntax, s);`
			`while (it != peg_token_iterator()) {`
Updated README. 2015-06-15 21:47:19 +00:00			`cout << it->str() << endl; // entire match`
			`cout << it->str(1) << endl; // submatch #1`
			`++it;`
Added simple interface. 2015-02-15 22:52:39 +00:00			`}`

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`// peg::peg_token_range`
Added simple interface. 2015-02-15 22:52:39 +00:00			`for (auto& m: peg_token_range(syntax, s)) {`
Updated README. 2015-06-15 21:47:19 +00:00			`cout << m.str() << endl; // entire match`
			`cout << m.str(1) << endl; // submatch #1`
Added simple interface. 2015-02-15 22:52:39 +00:00			`}`
			```

Major refactoring. 2015-02-09 22:12:59 +00:00			`Make a parser with parser operators`
			`-----------------------------------`
Uploaded files. 2015-02-08 01:52:26 +00:00
Major refactoring. 2015-02-09 22:12:59 +00:00			`Instead of makeing a parser by parsing PEG syntax text, we can also construct a parser by hand with parser operators. Here is an example:`
Uploaded files. 2015-02-08 01:52:26 +00:00
			```c++
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`using namespace peg;`
Uploaded files. 2015-02-08 01:52:26 +00:00			`using namespace std;`

Major refactoring. 2015-02-09 22:12:59 +00:00			`vector<string> tags;`

Corrected documentation. 2015-02-08 14:43:49 +00:00			`Definition ROOT, TAG_NAME, _;`
Fixed documentation. 2015-02-14 15:38:15 +00:00			`ROOT <= seq(_, zom(seq(chr('['), TAG_NAME, chr(']'), _)));`
Simplefiled API. 2015-06-16 04:25:01 +00:00			`TAG_NAME <= oom(seq(npd(chr(']')), dot())), [&](const SemanticValues& sv) {`
Added str() in SemanticValues. 2015-06-16 04:43:08 +00:00			`tags.push_back(sv.str());`
Fixed documentation. 2015-02-14 15:38:15 +00:00			`};`
			`_ <= zom(cls(" \t"));`
Uploaded files. 2015-02-08 01:52:26 +00:00
			`auto ret = ROOT.parse(" [tag1] [tag:2] [tag-3] ");`
			```

			`The following are available operators:`

Updated README. 2015-06-13 05:22:46 +00:00			`\| Operator \| Description \|`
			`\| :------- \| :-------------------- \|`
			`\| seq \| Sequence \|`
			`\| cho \| Prioritized Choice \|`
			`\| zom \| Zero or More \|`
			`\| oom \| One or More \|`
			`\| opt \| Optional \|`
			`\| apd \| And predicate \|`
			`\| npd \| Not predicate \|`
			`\| lit \| Literal string \|`
			`\| cls \| Character class \|`
			`\| chr \| Character \|`
			`\| dot \| Any character \|`
Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`\| tok \| Token boundary \|`
Fixed typo. 2015-06-13 05:23:27 +00:00			`\| ign \| Ignore semantic value \|`
Updated README. 2015-06-13 05:22:46 +00:00			`\| cap \| Capture character \|`
			`\| usr \| User defiend parser \|`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00
Modified documentation and the calc sample. 2015-02-20 03:51:04 +00:00			`Adjust definitions`
			`------------------`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00
Updated README. 2015-06-15 20:05:36 +00:00			`It's possible to add/override definitions.`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00
			```c++
			`auto syntax = R"(`
			`ROOT <- _ 'Hello' _ NAME '!' _`
			`)";`

Updated README. 2015-06-15 20:05:36 +00:00			`Rules additional_rules = {`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00			`{`
Fixed User rule problem. 2015-06-15 17:47:59 +00:00			`"NAME", usr([](const char* s, size_t n, SemanticValues& sv, any& c) -> size_t {`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00			`static vector<string> names = { "PEG", "BNF" };`
Fixed User rule problem. 2015-06-15 17:47:59 +00:00			`for (const auto& name: names) {`
			`if (name.size() <= n && !name.compare(0, name.size(), s, name.size())) {`
Updated README. 2015-06-15 21:47:19 +00:00			`return name.size(); // processed length`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00			`}`
			`}`
Updated README. 2015-06-15 21:47:19 +00:00			`return -1; // parse error`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00			`})`
			`},`
			`{`
			`"~_", zom(cls(" \t\r\n"))`
			`}`
			`};`

Changed namespace/class names. 2015-08-10 20:37:56 +00:00			`auto g = parser(syntax, additional_rules);`
Added 'usr' operator. 2015-02-20 03:27:47 +00:00
			`assert(g.parse(" Hello BNF! "));`
			```
Uploaded files. 2015-02-08 01:52:26 +00:00
Corrected README. 2015-02-08 01:58:25 +00:00			`Sample codes`
			`------------`

			`* [Calculator](https://github.com/yhirose/cpp-peglib/blob/master/example/calc.cc)`
Updated documentation. 2015-02-22 04:23:59 +00:00			`* [Calculator (with parser operators)](https://github.com/yhirose/cpp-peglib/blob/master/example/calc2.cc)`
			`* [Calculator (AST version)](https://github.com/yhirose/cpp-peglib/blob/master/example/calc3.cc)`
Updated README. 2015-08-06 11:56:31 +00:00			`* [PEG syntax Lint utility](https://github.com/yhirose/cpp-peglib/blob/master/lint/cmdline/peglint.cc)`
			`* [PL/0 Interpreter](https://github.com/yhirose/cpp-peglib/blob/master/language/pl0/pl0.cc)`
Corrected README. 2015-02-08 01:58:25 +00:00
Uploaded files. 2015-02-08 01:52:26 +00:00			`Tested Compilers`
			`----------------`

Updated README. 2015-08-04 22:10:53 +00:00			`* Visual Studio 2015`
Uploaded files. 2015-02-08 01:52:26 +00:00			`* Clang 3.5`

			`TODO`
			`----`

			`* Unicode support`

			`License`
			`-------`

			`MIT license (© 2015 Yuji Hirose)`