RDF 1.2 TriG

Abstract

This document defines a textual syntax for RDF called TriG that allows an RDF dataset to be completely written in a compact and natural text form, with abbreviations for common usage patterns and datatypes. TriG is an extension of the Turtle [ RDF12-TURTLE ] format.

RDF 1.2 TriG shares triple terms with [ RDF12-TURTLE ] as a fourth kind of RDF term which can be used as the object of another triple , making it possible to make statements about other statements. RDF 1.2 TriG also adds shares directional language-tagged strings with [ RDF12-TURTLE ].

Status of This Document

This is a preview

Do not attempt to implement this version of the specification. Do not reference this version as authoritative in any way. Instead, see https://w3c.github.io/rdf-trig/spec/ for the Editor's draft.

This section describes the status of this document at the time of its publication. A list of current W3C publications and the latest revision of this technical report can be found in the W3C standards and drafts index at https://www.w3.org/TR/.

This document is part of the RDF 1.2 document suite. TriG is intended the meet the charter requirement of the RDF Working Group to define an RDF syntax for multiple graphs. TriG is an extension of the Turtle syntax for RDF [ RDF12-TURTLE ]. The current document is based on the original proposal by Chris Bizer and Richard Cyganiak.

This document was published by the RDF & SPARQL Working Group as an Editor's Draft.

Publication as an Editor's Draft does not imply endorsement by W3C and its Members.

This is a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress. Future updates to this specification may incorporate new features .

This document was produced by a group operating under the W3C Patent Policy . W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy .

# This document encodes one graph. PREFIX ex: <http://www.example.org/vocabulary#> PREFIX : <http://www.example.org/exampleDocument#> :G1 { :Monica a ex:Person ; ex:name "Monica Murphy" ; ex:homepage <http://www.monicamurphy.org> ; ex:email <mailto:monica@monicamurphy.org> ; ex:hasSkill ex:Management , ex:Programming . }

# This document contains a default graph and two named graphs. PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX dc: <http://purl.org/dc/terms/> PREFIX foaf: <http://xmlns.com/foaf/0.1/> # default graph { <http://example.org/bob> dc:publisher "Bob" . <http://example.org/alice> dc:publisher "Alice" . } <http://example.org/bob> { _:a foaf:name "Bob" . _:a foaf:mbox <mailto:bob@oldcorp.example.org> . _:a foaf:knows _:b . } <http://example.org/alice> { _:b foaf:name "Alice" . _:b foaf:mbox <mailto:alice@work.example.org> . }

# This document contains a same data as the previous example. PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX dc: <http://purl.org/dc/terms/> PREFIX foaf: <http://xmlns.com/foaf/0.1/> # default graph - no {} used. <http://example.org/bob> dc:publisher "Bob" . <http://example.org/alice> dc:publisher "Alice" . # GRAPH keyword to highlight a named graph # Abbreviation of triples using ; GRAPH <http://example.org/bob> { [] foaf:name "Bob" ; foaf:mbox <mailto:bob@oldcorp.example.org> ; foaf:knows _:b . } GRAPH <http://example.org/alice> { _:b foaf:name "Alice" ; foaf:mbox <mailto:alice@work.example.org> }

VERSION "1.2" PREFIX : <http://www.example.org/> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> _:e38 :familyName "Smith" . _:anno rdf:reifies <<( _:e38 :jobTitle "Designer" )>> . _:anno :accordingTo _:e22 .

A TriG document is ~~a Unicode~~ an RDF string encoded in UTF-8 [ ~~UNICODE~~ RFC3629 ~~] character string encoded in UTF-8.~~ ]. Only Unicode ~~characters only~~ scalar values , in the ~~range~~ ranges U+0000 to U+D7FF and U+E000 to U+10FFFF inclusive, are allowed. This excludes surrogate code points , range U+D800 to U+DFFF.

White space (production WS ) is used to separate two terminals which would otherwise be (mis-)recognized as one terminal. Rule names below in capitals indicate where white space is significant; these form a possible choice of terminals for constructing a TriG parser .

White space is significant in the production String .

Comments in TriG start with a # outside an IRIREF , STRING_LITERAL_SINGLE_QUOTE , STRING_LITERAL_QUOTE , STRING_LITERAL_LONG_SINGLE_QUOTE , or STRING_LITERAL_LONG_QUOTE , and continue to the end of line (marked by LF , or CR ), or end of file if there is no end of line after the comment marker. Comments are treated as white space.

Relative IRI references are resolved with base IRIs as per Uniform Resource Identifier (URI): Generic Syntax [ RFC3986 ] using only the basic algorithm in section 5.2. Neither Syntax-Based Normalization nor Scheme-Based Normalization (described in sections 6.2.2 and 6.2.3 of RFC3986) are performed. Characters additionally allowed in IRI references are treated in the same way that unreserved characters are treated in URI references, per section 6.5 of Internationalized Resource Identifiers (IRIs) [ RFC3987 ].

The @base or BASE directive defines the Base IRI used to resolve relative IRI references per [ RFC3986 ] section 5.1.1, "Base URI Embedded in Content" . Section 5.1.2, "Base URI from the Encapsulating Entity" defines how the In-Scope Base IRI may come from an encapsulating document, such as a SOAP envelope with an xml:base directive or a MIME multipart document with a Content-Location header. The "Retrieval URI" identified in 5.1.3, Base "URI from the Retrieval URI" , is the URL from which a particular TriG document was retrieved. If none of the above specifies the Base URI, the default Base URI ( section 5.1.4, "Default Base URI" ) is used. Each @base or BASE directive sets a new In-Scope Base URI, relative to the previous one.

There are three forms of escapes used in TriG ~~documents:~~ document s:

A numeric escape ~~sequences represent~~ sequence represents the value of a Unicode code ~~points:~~ point .

A numeric escape sequence MUST NOT produce a code point value in the range U+D800 to U+DFFF, which is the range for Unicode surrogates .

Escape sequence	Unicode code point
`\u` `hex` `hex` `hex` `hex`	A Unicode code point in the ~~range~~ ranges `U+0000` to `U+FFFF U+D7FF` ~~inclusive~~ and `U+E000` to `U+D7FF`, corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit.
`\U` `hex` `hex` `hex` `hex` `hex` `hex` `hex` `hex`	A Unicode code point in the ~~range~~ ranges `U+0000` to `U+0010FFFF U+D7FF` ~~inclusive~~ and `U+E000` to `U+10FFFF`, corresponding to the value encoded by the eight hexadecimal digits interpreted from most significant to least significant digit.

where hex is a hexadecimal character

HEX ::= [0-9] | [A-F] | [a-f]

A string escape ~~sequences represent the characters~~ sequence represents a character traditionally escaped in string literals:

Escape sequence	Unicode code point
`\t`	`U+0009`
`\b`	`U+0008`
`\n`	`U+000A`
`\r`	`U+000D`
`\f`	`U+000C`
`\"`	`U+0022`
`\'`	`U+0027`
`\\`	`U+005C`

A reserved character escape ~~sequences consist~~ sequence consists of a \ followed by one of these characters ~.-!$&'()*+,;=/?#@%_, and ~~represent~~ represents the character to the right of the \ .

Context where each kind of escape sequence can be used
	numeric escapes	string escapes	reserved character escapes
IRI s, used as RDF terms or as in `PREFIX` or `BASE` declarations	yes	no	no
local name s	no	no	yes
String s	yes	yes	no

Note

%-encoded sequences are in the character range for IRIs and are explicitly allowed in local names. These appear as a '%' followed by two hex characters and represent that same sequence of three characters. These sequences are not decoded during processing. A term written as <http://a.example/%66oo-bar> in TriG designates the IRI http://a.example/%66oo-bar and not IRI http://a.example/foo-bar. A term written as ex:%66oo-bar with a prefix PREFIX ex: <http://a.example/> also designates the IRI http://a.example/%66oo-bar.

The EBNF used here is defined in XML 1.0 [ EBNF-NOTATION ].

Notes:

A blank node identifier represents the same blank node throughout the TriG ~~document.~~ document .
Keywords in single quotes (' @base ', ' @prefix ', ' @version ', ' a ', ' true ', and ' false ') are case-sensitive. Keywords in quotation marks (" BASE ", " PREFIX "), and " VERSION ") are case-insensitive.
Escape sequences UCHAR and ECHAR are case sensitive.
When tokenizing the input and choosing grammar rules, the longest match is chosen.
The TriG grammar is LL(1) and LALR(1) when the rules with uppercased names are used as terminals.
The entry point into the grammar is trigDoc .
In signed numbers, no white space is allowed between the sign and the number.
The strings ' @prefix ', ' @base ', and ' @version ' match the pattern for LANG_DIR , though neither prefix, base, nor version are registered language subtags . This specification does not define whether a quoted literal followed by either of these tokens (e.g., "A"@base ) is in the Turtle language.

[1]	`trigDoc`	::=	`(` directive `\|` block `)` `*`
[2]	`block`	::=	triplesOrGraph `\|` wrappedGraph `\|` triples2 `\|` `(` " `GRAPH` " labelOrSubject wrappedGraph `)`
[3]	`triplesOrGraph`	::=	`(` labelOrSubject `(` wrappedGraph `\|` `(` predicateObjectList ' `.` ' `)` `)` `)` `\|` `(` reifiedTriple predicateObjectList `?` ' `.` ' `)`
[4]	`triples2`	::=	`(` blankNodePropertyList predicateObjectList `?` ' `.` ' `)` `\|` `(` collection predicateObjectList ' `.` ' `)`
[5]	`wrappedGraph`	::=	' `{` ' triplesBlock `?` ' `}` '
[6]	`triplesBlock`	::=	triples `(` ' `.` ' triplesBlock `?` `)` `?`
[7]	`labelOrSubject`	::=	iri `\|` BlankNode
[8]	`directive`	::=	prefixID `\|` base `\|` version `\|` sparqlPrefix `\|` sparqlBase `\|` sparqlVersion
[9]	`prefixID`	::=	' `@prefix` ' PNAME_NS IRIREF ' `.` '
[10]	`base`	::=	' `@base` ' IRIREF ' `.` '
[11]	`version`	::=	' `@version` ' VersionSpecifier ' `.` '
[12]	`sparqlPrefix`	::=	" `PREFIX` " PNAME_NS IRIREF
[13]	`sparqlBase`	::=	" `BASE` " IRIREF
[14]	`sparqlVersion`	::=	" `VERSION` " VersionSpecifier
[15]	`VersionSpecifier`	::=	STRING_LITERAL_QUOTE `\|` STRING_LITERAL_SINGLE_QUOTE
[16]	`triples`	::=	`(` subject predicateObjectList `)` `\|` `(` blankNodePropertyList predicateObjectList `?` `)` `\|` `(` reifiedTriple predicateObjectList `?` `)`
[17]	`predicateObjectList`	::=	verb objectList `(` ' `;` ' `(` verb objectList `)` `?` `)` `*`
[18]	`objectList`	::=	object annotation `(` ' `,` ' object annotation `)` `*`
[19]	`verb`	::=	predicate `\|` ' `a` '
[20]	`subject`	::=	iri `\|` BlankNode `\|` collection
[21]	`predicate`	::=	iri
[22]	`object`	::=	iri `\|` BlankNode `\|` collection `\|` blankNodePropertyList `\|` literal `\|` tripleTerm `\|` reifiedTriple
[23]	`literal`	::=	RDFLiteral `\|` NumericLiteral `\|` BooleanLiteral
[24]	`blankNodePropertyList`	::=	' `[` ' predicateObjectList ' `]` '
[25]	`collection`	::=	' `(` ' object `*` ' `)` '
[26]	`NumericLiteral`	::=	INTEGER `\|` DECIMAL `\|` DOUBLE
[27]	`RDFLiteral`	::=	String `(` LANG_DIR `\|` `(` ' `^^` ' iri `)` `)` `?`
[28]	`BooleanLiteral`	::=	' `true` ' `\|` ' `false` '
[29]	`String`	::=	STRING_LITERAL_QUOTE `\|` STRING_LITERAL_SINGLE_QUOTE `\|` STRING_LITERAL_LONG_SINGLE_QUOTE `\|` STRING_LITERAL_LONG_QUOTE
[30]	`iri`	::=	IRIREF `\|` PrefixedName
[31]	`PrefixedName`	::=	PNAME_LN `\|` PNAME_NS
[32]	`BlankNode`	::=	BLANK_NODE_LABEL `\|` ANON
[33]	`reifier`	::=	' `~` ' `(` iri `\|` BlankNode `)` `?`
[34]	`reifiedTriple`	::=	' `<<` ' rtSubject verb rtObject reifier `?` ' `>>` '
[35]	`rtSubject`	::=	iri `\|` BlankNode `\|` reifiedTriple
[36]	`rtObject`	::=	iri `\|` BlankNode `\|` literal `\|` tripleTerm `\|` reifiedTriple
[37]	`tripleTerm`	::=	' `<<(` ' ttSubject verb ttObject ' `)>>` '
[38]	`ttSubject`	::=	iri `\|` BlankNode
[39]	`ttObject`	::=	iri `\|` BlankNode `\|` literal `\|` tripleTerm
[40]	`annotation`	::=	`(` reifier `\|` annotationBlock `)` `*`
[41]	`annotationBlock`	::=	' `{\|` ' predicateObjectList ' `\|}` '
Productions for terminals
[43]	`IRIREF`	::=	' `<` ' `(` `[` `^` `#x00` `-` `#x20` <>"{}\|^`\ `]` `\|` UCHAR `)` `` ' `>` ' / #x00=NULL #x01-#x1F=control codes #x20=space */
[44]	`PNAME_NS`	::=	PN_PREFIX `?` ' `:` '
[45]	`PNAME_LN`	::=	PNAME_NS PN_LOCAL
[46]	`BLANK_NODE_LABEL`	::=	' `_:` ' `(` PN_CHARS_U `\|` `[` `0-9` `]` `)` `(` `(` PN_CHARS `\|` ' `.` ' `)` `*` PN_CHARS `)` `?`
[47]	`LANG_DIR`	::=	' `@` ' `[` `a-zA-Z` `]` `+` `(` ' `-` ' `[` `a-zA-Z0-9` `]` `+` `)` `*` `(` ' `--` ' `[` `a-zA-Z` `]` `+` `)` `?`
[48]	`INTEGER`	::=	`[` `+-` `]` `?` `[` `0-9` `]` `+`
[49]	`DECIMAL`	::=	`[` `+-` `]` `?` `(` `[` `0-9` `]` `*` ' `.` ' `[` `0-9` `]` `+` `)`
[50]	`DOUBLE`	::=	`[` `+-` `]` `?` `(` `(` `[` `0-9` `]` `+` ' `.` ' `[` `0-9` `]` `*` EXPONENT `)` `\|` `(` ' `.` ' `[` `0-9` `]` `+` EXPONENT `)` `\|` `(` `[` `0-9` `]` `+` EXPONENT `)` `)`
[51]	`EXPONENT`	::=	`[` `eE` `]` `[` `+-` `]` `?` `[` `0-9` `]` `+`
[52]	`STRING_LITERAL_QUOTE`	::=	' `"` ' `(` `[` `^` `#x22` `#x5C` `#x0A` `#x0D` `]` `\|` ECHAR `\|` UCHAR `)` `*` ' `"` '
[53]	`STRING_LITERAL_SINGLE_QUOTE`	::=	" `'` " `(` `[` `^` `#x27` `#x5C` `#x0A` `#x0D` `]` `\|` ECHAR `\|` UCHAR `)` `*` " `'` "
[54]	`STRING_LITERAL_LONG_SINGLE_QUOTE`	::=	" `'''` " `(` `(` " `'` " `\|` " `''` " `)` `?` `(` `[` `^'\` `]` `\|` ECHAR `\|` UCHAR `)` `)` `*` " `'''` "
[55]	`STRING_LITERAL_LONG_QUOTE`	::=	' `"""` ' `(` `(` ' `"` ' `\|` ' `""` ' `)` `?` `(` `[` `^"\` `]` `\|` ECHAR `\|` UCHAR `)` `)` `*` ' `"""` '
[56]	`UCHAR`	::=	`(` ' `\u` ' HEX HEX HEX HEX `)` `\|` `(` ' `\U` ' HEX HEX HEX HEX HEX HEX HEX HEX `)`
[57]	`ECHAR`	::=	' `\` ' `[` `tbnrf\"'` `]`
[58]	`WS`	::=	`#x20` `\|` `#x09` `\|` `#x0D` `\|` `#x0A`
[59]	`ANON`	::=	' `[` ' WS `*` ' `]` '
[60]	`PN_CHARS_BASE`	::=	`[` `A-Z` `]`
		`\|`	`[` `a-z` `]`
		`\|`	`[` `#xC0` `-` `#xD6` `]`
		`\|`	`[` `#xD8` `-` `#xF6` `]`
		`\|`	`[` `#xF8` `-` `#x02FF` `]`
		`\|`	`[` `#x0370` `-` `#x037D` `]`
		`\|`	`[` `#x037F` `-` `#x1FFF` `]`
		`\|`	`[` `#x200C` `-` `#x200D` `]`
		`\|`	`[` `#x2070` `-` `#x218F` `]`
		`\|`	`[` `#x2C00` `-` `#x2FEF` `]`
		`\|`	`[` `#x3001` `-` `#xD7FF` `]`
		`\|`	`[` `#xF900` `-` `#xFDCF` `]`
		`\|`	`[` `#xFDF0` `-` `#xFFFD` `]`
		`\|`	`[` `#x00010000` `-` `#x000EFFFF` `]`
[61]	`PN_CHARS_U`	::=	PN_CHARS_BASE `\|` ' `_` '
[62]	`PN_CHARS`	::=	PN_CHARS_U `\|` ' `-` ' `\|` `[` `0-9` `]` `\|` `#xB7` `\|` `[` `#x0300` `-` `#x036F` `]` `\|` `[` `#x203F` `-` `#x2040` `]`
[63]	`PN_PREFIX`	::=	PN_CHARS_BASE `(` `(` PN_CHARS `\|` ' `.` ' `)` `*` PN_CHARS `)` `?`
[64]	`PN_LOCAL`	::=	`(` PN_CHARS_U `\|` ' `:` ' `\|` `[` `0-9` `]` `\|` PLX `)` `(` `(` PN_CHARS `\|` ' `.` ' `\|` ' `:` ' `\|` PLX `)` `*` `(` PN_CHARS `\|` ' `:` ' `\|` PLX `)` `)` `?`
[65]	`PLX`	::=	PERCENT `\|` PN_LOCAL_ESC
[66]	`PERCENT`	::=	' `%` ' HEX HEX
[67]	`HEX`	::=	`[` `0-9` `]` `\|` `[` `A-F` `]` `\|` `[` `a-f` `]`
[68]	`PN_LOCAL_ESC`	::=	' `\` ' `(` ' `_` ' `\|` ' `~` ' `\|` ' `.` ' `\|` ' `-` ' `\|` " `!` " `\|` ' `$` ' `\|` ' `&` ' `\|` " `'` " `\|` ' `(` ' `\|` ' `)` ' `\|` ' `*` ' `\|` ' `+` ' `\|` ' `,` ' `\|` ' `;` ' `\|` ' `=` ' `\|` ' `/` ' `\|` ' `?` ' `\|` ' `#` ' `\|` ' `@` ' `\|` ' `%` ' `)`

A text version of this grammar is available here .

This document uses some specific terminal literal strings [ EBNF-NOTATION ]. To clarify the Unicode code points used for these terminal literal strings, the following table describes specific characters and sequences used throughout this document.

Code	Glyph	Description
`U+000A`	`LF`	Line feed
`U+000D`	`CR`	Carriage return
`U+0022`	`"`	Quotation mark
`U+0023`	`#`	Number sign
`U+0027`	`'`	Apostrophe
`U+002D`	`-`	Hyphen
`U+003B`	`:`	Colon
`U+0040`	`@`	At sign
`U+005C`	`\`	Backslash
`U+005F`	`_`	Underscore
`U+0061`	`a`	Latin small letter E
`U+007B`	`{`	Left curly bracket
`U+007D`	`}`	Right curly bracket

Other short terminal literal strings are composed of specific sequences of Unicode characters:

space: U+0020
""": three concatenated quotation mark characters, each having the code point U+0022
''': three concatenated apostrophe characters, each having the code point U+0027
--: two concatenated - characters

The RDF Concepts and Abstract Syntax [ RDF12-CONCEPTS ] specification defines four types of RDF term : IRIs , literals , blank nodes , and triple terms . Literals are composed of a lexical form and an optional language tag [ BCP47 ] – possibly including an initial text direction – or an optional datatype IRI . An extra type, prefix, is used during parsing to map string identifiers to namespace IRIs. This section maps a string conforming to the grammar in 4.5 Grammar to a set of triples by mapping strings matching productions and lexical tokens to RDF terms or their components (e.g., language tags, lexical forms of literals). Grammar productions change the parser state and emit triples.

Parsing TriG requires a state of ten items:

IRI baseURI — When the base production is reached, the second rule argument, IRIREF , is the base URI used for relative IRI reference resolution.
Map[ prefix -> IRI] namespaces — The second and third rule arguments ( PNAME_NS and IRIREF ) in the prefixID production assign a namespace name ( IRIREF ) for the prefix ( PNAME_NS ). Outside of a prefixID production, any PNAME_NS is substituted with the namespace. Note that the prefix may be an empty string, per the PNAME_NS production: PN_PREFIX? ":".
~~Map[string~~ Map[ string -> blank node ] bnodeLabels — A mapping from a string to a blank node.
RDF Term curSubject — The curSubject is bound to the subject , rtSubject , ttSubject , blankNodePropertyList , collection , tripleTerm , and annotationBlock productions.
RDF_Term curPredicate — The curPredicate is bound to the the verb production. If token matched was a , curPredicate is bound to the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#type.
RDF Term curObject — The curObject is bound to the object , rtObject , and ttObject productions.
RDF Term curReifier — The curReifier is bound to the reifier and annotationBlock productions.
Triple term curTripleTerm — The curTripleTerm is set in the Annotations constructor.
RDF_Term curGraph — The curGraph is bound to the label of the graph that is the destination of triples produced in parsing. When undefined, triples are destined for the default graph .
xsd:string curVersion – The RDF version used for parsing the document into Triples . If specified as part of a Media Type , the default value for curVersion is taken from the version parameter. The only defined value is "1.2". Parser behavior for other version values is undefined. This specification does not define behavior if different versions are provided for curVersion.

Term Constructors can create a stack of these values indicated by using language such as "records the curSubject and curPredicate."

This table maps productions and lexical tokens to RDF terms or components of RDF terms listed in 5. Parsing :

production	type	procedure
IRIREF	IRI	The characters between "<" and ">" are taken, after the numeric escape sequences are processed, to form the Unicode string of the IRI. Relative IRI reference resolution is performed per 4.3 IRI References .
PNAME_NS	prefix	When used in a `prefixID` or `sparqlPrefix` production, the `prefix` is the potentially empty Unicode string matching the first argument of the rule is a key into the namespaces map .
PNAME_NS	IRI	When used in a `PrefixedName` production, the `iri` is the value in the namespaces map corresponding to the first argument of the rule.
PNAME_LN	IRI	A potentially empty prefix is identified by the first sequence, `PNAME_NS` . The namespaces map MUST have a corresponding `namespace`. The Unicode string of the IRI is formed by unescaping the reserved characters in the second argument, `PN_LOCAL` , and concatenating this onto the `namespace`.
VersionSpecifier	literal	The `curVersion` is taken from a literal using the matched RDF string lexical form and `xsd:string` datatype.
STRING_LITERAL_SINGLE_QUOTE	lexical form	The characters between the outermost `'` s are taken, with numeric and string escape sequences unescaped, to form the Unicode string of a lexical form.
STRING_LITERAL_QUOTE	lexical form	The characters between the outermost `"` s are taken, with numeric and string escape sequences unescaped, to form the Unicode string of a lexical form.
STRING_LITERAL_LONG_SINGLE_QUOTE	lexical form	The characters between the outermost `'''` s are taken, with numeric and string escape sequences unescaped, to form the unicode string of a lexical form.
STRING_LITERAL_LONG_QUOTE.	lexical form	The characters between the outermost `"""` s are taken, with numeric and string escape sequences unescaped, to form the Unicode string of a lexical form.
LANG_DIR	language tag	The characters following the `@` form the language tag and optionally the initial text direction , if the matched characters include `--` .
RDFLiteral	literal	The literal has a lexical form of the first rule argument, `String` . If the `'^^' iri` rule is matched, the datatype IRI is derived from the `iri`, and the literal has no language tag. If the `LANG_DIR` rule is matched, the language tag and initial text direction are taken from `LANG_DIR` . If there is no initial text direction , the datatype is `rdf:langString`. If there is an initial text direction , the datatype is `rdf:dirLangString`. If neither matched, the datatype is `xsd:string`, and the literal has no language tag.
INTEGER	literal	The literal has a lexical form of the input string, and a datatype of `xsd:integer`.
DECIMAL	literal	The literal has a lexical form of the input string, and a datatype of `xsd:decimal`.
DOUBLE	literal	The literal has a lexical form of the input string, and a datatype of `xsd:double`.
BooleanLiteral	literal	The literal has a lexical form of the `true` or `false`, depending on which matched the input, and a datatype of `xsd:boolean`.
BLANK_NODE_LABEL	blank node	The string matching the second argument, `PN_LOCAL` , is a key in bnodeLabels . If there is no corresponding blank node in the map, one is allocated.
ANON	blank node	A blank node is generated.
blankNodePropertyList	blank node	A blank node is generated. Note the rules for `blankNodePropertyList` in the next section.
collection	blank node	For non-empty lists, a blank node is generated. Note the rules for `collection` in the next section.
collection	IRI	For empty lists, the resulting IRI is `rdf:nil`. Note the rules for `collection` in the next section.
reifier	IRI \| blank node	The `curReifier` is taken from term , which is taken from the matched `iri` production or `BlankNode` production, if any. If no such production is matched, term is taken from a fresh RDF blank node .
tripleTerm	triple term	The triple term is composed of the terms constructed from the `ttSubject` , `predicate` , and `ttObject` productions.
reifiedTriple	IRI \| blank node	The term is taken from the matched `reifier` , if any, or from a fresh RDF blank node .
annotationBlock	IRI \| blank node	The term is taken from a previously matched reifier , if any, or from a fresh RDF blank node .

A TriG document defines an RDF Dataset composed of one default graph and zero or more named graphs . Each graph is composed of a set of RDF triples .

The state curGraph is initially unset. It records the label of the graph for triples produced during parsing. If undefined, the default graph is used.

The rule labelOrSubject sets both curGraph and curSubject (only one of these will be used).

The following grammar production clauses set curGraph to be undefined, indicating the default graph:

The grammar production clause wrappedGraph in rule block
The grammar production in rule triples2 .

The grammar production labelOrSubject predicateObjectList '.' unsets curGraph before handling the predicateObjectList production in rule triplesOrGraph .

Each RDF triple produced is added to curGraph, or the default graph if curGraph is not set at that point in the parsing process.

The subject production sets the curSubject. The verb production sets the curPredicate.

Triples are produced at the following points in the parsing process and each RDF triple produced is added to the graph identified by curGraph.

Each object N in the document produces an RDF triple: curSubject curPredicate N.

Beginning the reifier production, the curReifier is taken from the reifier term constructor. Then yield the the RDF triple curReifier rdf:reifies curTripleTerm.

Beginning the reifiedTriple production records the curTripleTerm. A new tripleTerm instance curTripleTerm is created using the rtSubject , verb , and rtObject productions. Finishing the reifiedTriple production, if the curReifier is not set, it is assigned a fresh RDF blank node ; it next yields the RDF triple curReifier rdf:reifies curTripleTerm, and then restores the recorded value of the curTripleTerm. The node produced by matching reifiedTriple is the the curReifier.

Beginning the annotation production records the curSubject and curPredicate. A new tripleTerm instance curTripleTerm is created using the curSubject curPredicate curObject, and the value of the curReifier is cleared. Finishing the annotation production restores the recorded values of the curSubject and curPredicate.

Beginning the annotationBlock production records the curTripleTerm. If the curReifier is not set, then it is assigned a fresh RDF blank node and the production yields the RDF triple curReifier rdf:reifies curTripleTerm. The curSubject is taken from the curReifier Finishing the annotationBlock production clears the value of the curReifier and restores the curTripleTerm.

Note

If the curReifier was already set, the reifying triple curReifier rdf:reifies curTripleTerm was emitted in 5.3.4 Reifiers .

Beginning the blankNodePropertyList production records the curSubject and curPredicate, and sets curSubject to a novel blank node B. Finishing the blankNodePropertyList production restores curSubject and curPredicate. The node produced by matching blankNodePropertyList is the blank node B.

Beginning the collection production records the curSubject and curPredicate. Each object in the collection production has a curSubject set to a novel blank node B and a curPredicate set to rdf:first. For each object object _n after the first produces a triple: object _n-1 rdf:rest object _n . Finishing the collection production creates an additional triple curSubject rdf:rest rdf:nil . and restores curSubject and curPredicate The node produced by matching collection is the first blank node B for non-empty lists and rdf:nil for empty lists.

RDF 1.2 TriG

RDF Dataset Language

Abstract

Status of This Document

Set of Documents

1. Introduction

2. TriG Language

2.1 Version Announcement

2.2 Triple Statements

2.3 Graph Statements

2.4 Triple Terms

2.5 Other Terms

2.5.1 Special Considerations for Blank Nodes

3. Conformance

3.1 Media Type and Content Encoding

4. TriG Grammar

4.1 White Space

4.2 Comments

4.3 IRI References

4.4 Escape Sequences

4.5 Grammar

Productions for terminals

4.6 Selected Terminal Literal Strings

5. Parsing

5.1 Parser State

5.2 RDF Term Constructors

5.3 RDF Triples Construction

5.3.1 Output Graph

5.3.2 Triple Output

5.3.3 Triple Production

5.3.4 Reifiers

5.3.5 Reified Triples

5.3.6 Annotations

5.3.7 Annotation Blocks

5.3.8 Property Lists

5.3.9 Collections

A. Privacy Considerations

B. Security Considerations

C. Internet Media Type, File Extension and Macintosh File Type

D. Acknowledgments

D.1 Acknowledgments for RDF 1.1

D.2 Acknowledgments for RDF 1.2

E. Changes between RDF 1.1 and RDF 1.2

F. Index

F.1 Terms defined by this specification

F.2 Terms defined by reference

G. References

G.1 Normative references

G.2 Informative references