A bnd instruction is a property that starts with a minus sign (‘-‘). An instruction instructs bndlib to do something, in general providing parameters to the code. All instructions in bndlib are listed later in this page.
Almost all bndlib instructions follow the general OSGi syntax. However, the bndlib syntax is in general a bit more relaxed, for example you can use either single quotes (‘’’) or double quotes (‘”’) while OSGi only allows double quotes. Dangling comma’s and some other not exactly correct headers are accepted by bndlib without complaining. Values without quotes are accepted as long as they cannot confuse the general syntax. For example, a value like 1.20 does not have to be quoted since the period (‘.’) cannot confuse the parser on that place, however, a value like [1.3,4) must be quoted since it contains a comma.
In this mannual we adhere to the same conventions as the OSGi specification. This is mostly reflected in the names for the terminals in the syntax specifications. As a reminder we repeat the syntax rules and common terminals:
*
– Repetition of the previous element zero or more times, e.g. ( ',' element ) *
+
– Repetition one or more times?
– Previous element is optional( ... )
– Grouping|
– Or[...]
– Set (one of)..
– Range<...>
– Externally defined token, followed with a description~
– Not, negationThe following terminals are pre-defined:
ws ::= <see Character.isWhitespace>
digit ::= [0..9]
alpha ::= [a..zA..Z]
alphanum ::= alpha | digit
token ::= ( alphanum | '_' | '-' )+
number ::= digit+
jletter ::= <see [JSL][1] for JavaLetter>
jletterordigit ::= <See [JLS][1] for JavaLetterOrDigit >
qname ::= <See [JLS][1] for fully qualified class name >
identifier ::= jletter jletterordigit *
extended ::= ( alphanum | '_' | '-' | '.' )+
quoted-string-d ::= '"' ( ~["\#x0D#x0A#x00] | '\"'|'\')* '"'
quoted-string-s ::= ''' ( ~['\#x0D#x0A#x00] | '\''|'\')* '''
quoted-string ::= quoted-string-s | quoted-string-d
special-chars ::= ~["\#x0D#x0A#x00:=;,<See [JLS][1] for whitespace>]
path ::= special-chars+ | quoted-string
value ::= quoted-string | path
argument ::= extended | quoted-string
parameter ::= directive | attribute
directive ::= extended ':=' argument
attribute ::= extended '=' argument
parameters ::= parameter ( ',' parameter ) *
unique-name ::= identifier ( '.' identifier )*
symbolic-name ::= token('.'token)*
package-name ::= unique-name
version ::= major( '.' minor ( '.' micro ( '.' qualifier )? )? )?
major ::= number
minor ::= number
micro ::= number
qualifier ::= ( alphanum | '_' | '-' )+
version-range ::= interval | atleast
interval ::= ( '[' | '(' ) floor ',' ceiling ( ']' | ')' )
atleast ::= version
floor ::= version
ceiling ::= version
White spaces between terminals are ignored unless specifically noted. The only exception is the directive, a directive must be connected to the colon (‘:’). That is, bndlib stores attributes and directives in the same map and distinguishes them by the fact that directives end in a colon and attributes do not. For example, using a directive like foo; path := http://www.xyz.com
will not work correctly because the path :
is not a valid attribute nor directive name. Any value that contains a space, a tab, a comma, colon, semicolon, equal sign or any other character that is part of a terminal in the grammar must be quoted.
Almost all bndlib instructions follow the general OSGi syntax for a header.
-instruction ::= parameters
parameters ::= clause ( ',' clause ) *
clause ::= path ( ';' path ) *
( ';' parameter ) *
There are a number of short-cuts that are regularly used in bnd commands:
list ::= value ( ',' value ) *
url ::= <URL>
file-path ::= <file path, using / as separator>
Most instructions that accept multiple clauses are merged instructions. A merged instruction is an instruction which is specified by its prefix but which will merge any property that starts with this prefix followed by a period (‘.’). For example, the base instruction -plugin
instruction accepts -plugin.git
as well. The reason for this merging is that allows to append the base instruction with instructions from other files. For example, the bnd file in the cnf/ext
directory are automatically included in the properties of the workspace. Since there can be many files, there would be a need to coordinate the using of a singleton base instruction. As always, singletons suck and in this case we solved it with merged instructions.
Some instructions put, sometimes very subtle, semantics on their ordering. To prevent different results from run to run, we make order the merge properties by placing the base instruction in front and then followed by the values from the other keys in lexically sorted order. For example, -a
, -a.x
, and -a.Z
, and -a.1
will concatenate the clauses in the order -a,-a.1,-a.Z,-a.x
.
Each of the constituents of the merged properties can be empty, this is properly handled by bndlib and will not create a an empty clause as would easily happen when the same thing was done with macros. Empty clauses can help when there is an optionality that depends on some condition.
As an example, lets set the -buildpath
instruction:
-buildpath: com.example.foo;version=1.2
-buildpath.extra: ${if;${debug};com.example.foo.debug\;version=1.2}
This will result in a buildpath of (when debug is not false) of: com.example.foo;version=1.2, com.example.foo.debug;version=1.2
.
If a value in an instruction has a scope then you will be able to use a selector. A scope is a well defined, finite, set. For example, if you use the ‘-exportcontents’ instruction you can specify the packages that can be exported. Since you can only export the packages that are inside your bundle, the scope is the set of package names that were collected so far.
The syntax for a selector is:
selector ::= '!' ?
'=' ?
'*' | ( '.' | '?' | '|' | '$' | . | other ) *
( '.*' ) ?
( ':i' ) ?
A selector is an expression that is matched against all members of the scope. For example, com.example.*
will match any package name that starts with com.example
, for example com.example.foo
, as well as com.example
itself. The syntax used to describe the wildcarding is based on a globbing kind model, however, the selector is translated into a regular expression according to some simple rules.
The replacement rules to build the regular expression are as follows:
*
– Is replaced with .*
, matching any number of characters, triggers wildcard match.?
- Is replaced with .?
, matching any character, triggers wildcard match.|
– Is inserted as is, triggers wildcard match.$
– Escaped to \$
..
– The dot is treated as a package segment separator. It is escaped with \.
. However, if the match ends with .*
, the replacement will be (\..*)?
. This triggers a wildcard match.If you want to go ballistic with regular expressions, then you can go ahead. As long as the wildcards are triggered by one of the defined characters, the replaced string will be used as a regular expression.
If the selector ends with :i
then a case insensitive match is requested, this will ignore any case differences.
A selector can be prefixed with an exclamation mark, this indicates that a match must be treated as a removal operation. To understand removal, it is necessary to realize that the selectors are not declarative, the order of the selectors is relevant.
When bndlib has a scope, it will iterator over each element in the scope and then for element it will iterator over each selector. If there is a match on the scope member then it is always removed from the scope, it will not be matched against later selectors. If the selector starts with an exclamation mark then the member is removed. but it is not treated as a result. Ergo, the exclamation mark allows you to remove members before they are actually matched.
For example, you want to match all implementation packages (i.e. they have impl
in their name) except when they start with com.example
. This can be achieved with:
!com.example.*impl*, *.impl.*
Last, and also least, you can prefix the selector with an equal sign (‘=’) (after the exclamation mark if one is present). This will signal a literal match with the string, no globbing will be done.
The way the period (‘.’) is treated is kind of special. The reason is that people expect that the selector com.example.*
actually includes the package com.example
itself. In bndlib this has always been the case although the OSGi Alliance later decided to not follow this pattern when there was a need for the globbing of packages.
In this example, the first selector will remove any match and ignore it, the second selector now only sees scope members that are fitting in the earlier pattern.
Selectors are used in virtually any place where there is a reasonable scope. It is a very powerful feature. However, it is also easy to go overboard. If you find you need to use this feature excessively then you are likely overdoing it.
One of the most painful thing in making bndlib run anywhere is handling of files. Even though Java has a decent abstraction of the file system, problems with spaces in file names and backslashes in the wrong place have caused an uneven amount of bugs. Ok, this partially because most bndlib developers tend to use Macs, still no excuse and quite embarrassing.
Since bnd files need to be portable across environments we’ve chosen to use the Unix file separator, the slash (or solidus ‘/’) for more reasons than I can reasonably sum up here (what was Bill smoking when he picked the reverse solidus for file separator!). In bndlib, all file paths (ok, should) always go through a single method that actually parses it and turns it into a Java File object. This allows us to support a number of features in a portable way. The syntax of a file path is therefore:
file ::= ( '~/' | '/' )? ( ~['/']+ '/' ) * ~['/'] *
If a file path starts with:
You can use the ‘..’ operator for a segment in the file path to indicate a parent directory. It is highly advised to always use relative addressing to keep your workspace portable. However, there are a number of macros that can be used as anchors:
${p}
, `${project} – The directory of the current project${workspace}
– The directory of the current workspace${build} – The directory of the
cnf` directory.A FILESPEC
defines a set of files relative to a starting point. It can be identical to a FILE for a single file but it has some special syntax to recurse directories. For example, the FILESPEC foo/bar/**/spec/*.ts
finds all files that end in spec/*.ts
somewhere below the foo/bar
directory.
The syntax is as follows:
FILESPEC ::= filespec ( ',' filespec )*
filespec ::= ( segment '/' ) filematch
segment ::= '**' | GLOB
filematch ::= '**' | '**' GLOB | GLOB
If a segment is **
then it will match any directory to any depth. Otherwise it must be a GLOB expression, which can also be a literal. The last segment is the filematch
. This is a GLOB expression on a file name. As a convenience, if it is **
, any file will match in any directory and if it starts with something like **.ts
then it will also recurse and match on *.ts
. That is, the following rules apply to the filematch
:
prefix/** prefix/**/*
prefix/**.ts prefix/**/*.ts
A PATH is a repository specification. It is a specification for a number of JARs and/or bundles from a repository (well, most of the time). A PATH has the following syntax:
PATH ::= target ( ',' target ) *
target ::= ( entry | FILE ';' 'version' '=' 'file' )
( ';' PARAMETERS ) *
entry ::= symbolic-name ( ';' version ) ?
version ::= 'version' '='
( RANGE | 'latest' | 'project')
A PATH defines a number of bundles with its entries. Each entry is either a FILE or a symbolic-name. If only a symbolic name is specified, the default will be the import range from 0.0.0. This selects the whole repository for the given symbolic name. Outside the default, the repository entry can be specified in the following ways:
latest
– Use the project’s output or, if the project does not exists, get the bundle with the highest version from the repositories.project
– Mandate the use of the project’s outputA glob is a pattern that can be matched against a candidate to see if it matches. To match multiple candidates with
one pattern, the pattern is allowed to contain wildcard characters. A wildcard character is a stand in for zero or
more other characters. For example, a glob like *.xml
matches any name that ends with .xml
, the asterisk *
stands
in for zero or more characters. Therefore, it matches foo.xml
, xml.xml.xml
, but also just .xml
. The other
wildchar is the question mark (?
) which matches exactly one character. For example, ???.xml
matches abc.xml
but not a.xml
or .xml
.
Sometimes it is necessary to match against a number of strings. The vertical bar (|
) can separate these strings. For
example, abc|def|ghi
matches any of abc
, def
, or ghi
.
It is also possible to group strings inside the pattern using parentheses. For example, foo(a|b)bar
matches fooabar
or foobbar
.
The glob also supports character classes with the square brackets [
and ]
. Characters between the square brackets
literally match to any character at that position. These character classes also form a group. For example, foo[abc]bar
matches
fooabar
, foobbar
, or foocbar
.
Groups can be repeated using a question mark (cardinality 0..1), the asterisk (cardinality 0..n), or the plus sign (cardinality 1..n). For example, (a)*
matches aaaa
but also the empty string. If the standard cardinalities do not suffice, then a group can be suffixed by a
cardinality specification using the curly braces ({
and }
). Inside the curly braces the lower bound and the optional upper
bound can be specified. If no upper bound is specified then the lower bound is also the upper bound.
For example, (?i)([0-9A-F][0-9A-F]){20}
matches a 40 digit SHA digest in hex regardless of case.
There are some special characters that get special treatment:
special ::= '{' | '}' | '|' | '+' | '*' | '?' | '(' | ')' | '[' | ']'
These characters can be escaped with a reverse solidus (\
) to override their special meaning, they will then be matched literally. To
escape a long string, it is possible to place \Q
at the beginning of a block that needs to be escaped and \E
at the end.
For example, foo\Q*****\Ebar
Some examples:
Glob | Regular Expression | Matches |
---|---|---|
abc.ts |
abc\.ts |
abc.ts |
*.ts |
.*\.ts |
abc.ts, def.ts |
foo***bar |
foo.*.*.*bar |
foobar, fooXbar, fooXXXXbar |
foo(A|B|C)*bar |
foo(A|B|C)*bar |
fooAbar, fooBBBAbar,fooCbar |
foo\{\}bar |
foo\{\}bar |
foo{}bar |
xx(?i)xx |
xx(?i)xx |
xxXx , xxXX , xxxx |
Some instruction use the GLOB but provide some convenience. For example, the package selectors of Export-Package et. al.
detect a case like com.example.*
and turn it into a regular expression that matches com.example.foo
but also matches
com.example
. This is generally explained in the instruction.
This section defines how globs are mapped to regular expressions. Although the globs are quite intuitive to use, they do expose all regular expression capabilities. A strong advice is to keep it simple. The mapping itself is actually non-trivial and takes a number of heuristics that can easily go wrong. Basically, if you need this section you’re likely doing something that is too complex.
For backward compatibility reasons, we support the unix like glob style for or’ing like {a,b,c}
. However, it is
recommended to use the form using the vertical bar since it is more flexible and closer to regular expressions.
A glob is mapped to a regular expression character by character. During this traversal the following states are kept:
SIMPLE
– Basic simple glob form. This is the default.CURLIES
– Inside a curly braces group that is not a cardinality specification. Curly braces can be nested.QUOTED
– Inside a \Q...\E
block (cannot be nested)BRACKETS
– Inside a character group enclosed by square brackets (cannot be nested).Character sets:
escaped ::= `.` | `$` | `^` | `@` | `%`
special ::= `?` | `*` | `+`
end ::= ')' | ']' | '}'
start ::= '('
Escaped characters are escaped in SIMPLE mode and CURLIES mode. In QUOTED or BRACKETS mode no escaping is done. For example
^$
becomes \^\$
matching literally ^$
. However, [^$]
becomes [^$]
, matching anything but a dollar sign. In QUOTED
and BRACKETS mode, also the special
, end
, and start
sets are directly inserted without any special handling.
In the SIMPLE and CURLIES mode the special
characters require special handling. For these remaining mappings it is important
to realize that there are groups. There are parenthesized groups, bracketed group, and curly braces groups. For
example, (a|b)
, [abcd]
, or {a,b,c}
respectively. If curly braces are used after a group then it is a quantifier and not considered a group. That is, {a,b}
is
considered a group but [a,b]{1,2}
then the {1,2}
is a quantifier and not a group.
So when a special
character is preceded with a group, it is inserted without any special processing. If it is not
preceded then the following rules apply:
*
– mapped to .*
, this matches any number of characters?
– mapped to .
, this matches one character+
– mapped to \+
, matches a plus sign!Some examples:
Glob | Regular Expression |
---|---|
* |
.* |
(a)* |
(a)* |
[abc]+ |
[abc]+ |
. |
\. |
[.*+|] |
[.*+|] |
+ |
\+ |
(x|y)+ |
(x|y)+ |
{x,y} |
(?:x|y) |
{[xa],y} |
(?:[xa]|y) |
(?:foo) |
(?:foo) |
{foo} |
(?:foo) |
[\p{Lower}] |
[\p{Lower}] |
[a-z&&[^bc]] |
[a-z&&[^bc]] |