作者: Martin Fowler
出版年: 2010-9
ISBN: 9780132107549
- I.Narratives
- 1. An Introductory Example
- 2.Using Domain-Specific Languages
- 3.Implementing DSLs
- 4.Implementing an Internal DSL
Figure 1.1 State diagram for Miss Grant’s secret compartment
When working in Java, the natural way to do this is through a Domain Model
[Fowler PoEAA] of a state machine.
Figure 1.2 Class diagram of the state machine framework
I keep them as separate classes (with a superclass) as they play different roles in the controller code.
Class AbstractEvent{
private String name, code;
public AbstractEvent(String name, String code) {
this.name = name;
this.code =code;
}
public String getCode() {return code;}
public String getName() {return name;}
}
public Class Command extends AbstractEvent;
public Class Event extends AbstractEvent;
The state class keeps track of the commands that it will send and its outbound transitions.
class State...
private String name;
private List<Command> actions = new ArrayList<Command>();
private Map<String, Transition> transitions = new HashMap<String, Transition>();
class State...
public void addTransition(Event event, State targetState) {
assert null != targetState;
transition.put(event.getCode(), new Transition(this, event, targetState));
}
class Transition...
private final State source, target;
private final Event trigger;
public Transition(State source, Event trigger, State target) {
this.source = source;
this.target = target;
this.trigger = trigger;
}
public State getSource() {return source;}
public State getTarget() {return target;}
public Event getTrigger() {return trigger;}
public String getEventCode() {return trigger.getCode();}
The state machine holds on to its start state.
class StateMachine...
private State start;
public StateMachine(State start) {
This.start = start;
}
Then, any other states in the machine are those reachable from this state.
class StateMachine...
public Collection<State> getStates() {
List<State> result = new ArrayList<State>();
collectStates(result, start);
return result;
}
private void collectStates(Collection<State> result, State s) {
if (result.contains(s)) return;
result.add(s);
for (State next: s.getAllTargets())
collectStates(result, next);
}
class State...
Collection<State> getAllTargets() {
List<State> result = new ArrayList<State>();
for (Transition t : transitions.values())
result.add(t.getTarget());
return result;
}
To handle reset events, I keep a list of them on the state machine.
class StateMachine...
private List<Event> resetEvents = new ArrayList<Event>();
public void addResetEvents(Event... events) {
for (Event e : events) resetEvents.add(e);
}
I don’t need to have a separate structure for reset events like this. I could handle this by simply declaring extra transitions on the state machine like this:
class StateMachine...
private void addResetEvent_byAddingTransitions(Event e) {
for (State s : getStates())
if (!s.hasTransition(e.getCode())) s.addTransition(e, start);
}
The controller has a handle method that takes the event code it receives from the device.
class Controller {
private State currentState;
private StateMachine machine;
private CommandChannel commandChannel;
public CommandChannel getCommandChannel() {
return this.commandChannel;
}
public void handle(String eventCode) {
if (currentState.hasTransition(eventCode)) {
transitionTo(currentState.targetState(eventCode));
} else if (machine.isResetEvent(eventCode)) {
transitionTo(machine.getStart());
// ignore unknown events
}
}
private void transitionTo(State target) {
currentState = target;
currentState.executeActions(commandsChannel);
}
}
class State {
public boolean hasTransition(String eventCode) {
return transitions.containsKey(eventCode);
}
public State targetState(String eventCode) {
return transitions.get(eventCode).getTarget();
}
public void executeActions(CommandChannel commandsChannel) {
for (Command c : actions) commandsChannel.send(c.getCode());
}
}
class StateMachine {
public boolean isResetEvent(String eventCode) {
return resetEventCodes().contains(eventCode);
}
private List<String> resetEventCodes() {
List<String> result = new ArrayList<String>();
for (event e : resetEvents) result.add(e.getCode());
return result;
}
}
Now that I’ve implemented the state machine model, I can program Miss Grant’s controller like this:
Essentially, it is the separation of common code from variable code. We structure the common code in a set of components that we then configure for different purposes.
Figure 1.3 A single library used with multiple configurations
Here is another way of representing that configuration code:
Here’s another version of the configuration code:
This language is a domain-specific language that shares many of the characteristics of DSLs. Now look at this code.
It’s a bit noisier than the custom language earlier, but still pretty clear. Readers whose language likings are similar to mine will probably recognize it as Ruby.
An external DSL
is a domain-specific language represented in a separate language to the main programming language it’s working with. This language may use a custom syntax, or it may follow the syntax of another representation such as XML. An internal DSL
is a DSL represented within the syntax of a general-purpose language. It’s a stylized use of that language for a domain-specific purpose.
You may also hear the term embedded DSL
as a synonym for internal DSL.
Now think again about the original Java configuration code. Is this a DSL? I would argue that it isn’t. Does this mean you can’t do an internal DSL in Java? How about this:
It’s formatted oddly, and uses some unusual programming conventions, but it is valid Java. This I would call a DSL; although it’s more messy than the Ruby DSL, it still has that declarative flow that a DSL needs.
Another term you may come across for an internal DSL is a fluent interface
. This term emphasizes the fact that an internal DSL is really just a particular kind of API, designed with this elusive quality of fluency. Given this distinction, it’s useful to have a name for a nonfluent API—I’ll use the term command-query API
.
If you’re used to using Domain Models [Fowler PoEAA], for the moment you can think of a Semantic Model as very close to the same thing.
Figure 1.4 Parsing a DSL populates a Semantic Model.
One opinion I’ve formed is that the Semantic Model is a vital part of a well-designed DSL. In the wild you’ll find some DSLs use a Semantic Model and some do not, but I’m very much of the opinion that you should almost always use a Semantic Model.
In my discussion so far, I process the DSL to populate the Semantic Model and then execute the Semantic Model to provide the behavior that I want from the controller. This approach is what’s known in language circles as interpretation
. When we interpret some text, we parse it and immediately produce the result that we want from the program.
In the language world, the alternative to interpretation is compilation
. With compilation, we parse some program text and produce an intermediate output, which is then separately processed to provide the behavior we desire. In the context of DSLs, the compilation approach is usually referred to as code generation
.
Figure 1.5 An interpreter parses the text and produces its result in a single process.
Figure 1.6 A compiler parses the text and produces some intermediate code which is then packaged into another process for execution.
Domain-specific language
(noun): a computer programming language of limited expressiveness focused on a particular domain.
I’ll start with internal DSLs.Mike Roberts suggested to me that a command-query API defines the vocabulary of the abstraction, whereas an internal DSL adds a grammar.
A common way of documenting a class with a command-query API is to list all the methods it has. The methods of an internal DSL often only make sense in the context of a larger expression in the DSL.
As a result, an internal DSL should have the feel of putting together whole sentences, rather than a sequence of disconnected commands. This is the basis for calling these kinds of APIs fluent interfaces.
With external DSLs, the boundary is with general-purpose programming languages. Languages can have a domain focus but still be general-purpose languages.
A more obvious DSL is regular expressions. Here, the domain focus (matching text) is coupled with limited features—just enough to make text matching easy. One common indicator of a DSL is that it isn’t Turing-complete
.
The heart of the appeal of a DSL is that it provides a means to more clearly communicate the intent of a part of a system.
The model alone provides a considerable improvement in productivity. It avoids duplication by gathering together common code; above all, it provides an abstraction to think about the problem that makes it easier to specify what’s going on in an understandable way. A DSL enhances this by providing a more expressive form to read and manipulate that abstraction. A DSL can help people learn how to use an API since it shifts focus to how different API methods should be combined together.
When people talk about DSLs in this context, it’s often along the lines of “Now we can get rid of programmers and have business people specify the rules themselves.” I call this argument the COBOL fallacy
—since that was the expectation with COBOL.
Despite the COBOL fallacy, I do think DSLs can improve communication. It’s not that domain experts will write the DSLs themselves; but they can read them and thus understand what the system thinks it’s doing. By being able to read DSL code, domain experts can spot mistakes. They can also talk more effectively to the programmers who do write the rules, perhaps by writing some rough drafts that can be refined into proper DSL rules.
Using a DSL like this can often make up for limitations in a host language, allowing us to express things in a comfortable DSL and then generate code for the actual execution environment to use.
A model can facilitate this kind of shift. Once you have a model, it’s easy to either execute it directly or generate code from it. Models can be also be populated from a forms-style interface as well as a DSL. A DSL has a couple of advantages over using forms. DSLs are often better than forms at representing complicated logic.
Mainstream programming is pretty much all done using an imperative
model of computation. Imperative computation has become popular because it’s relatively easy to understand and easy to apply to lots of problems. However, it isn’t always the best choice.
You often hear such nonimperative
approaches referred to as declarative programming
. The notion is that these styles allow you to declare what
should happen, rather than work through the imperative statements that describe how
the behavior works.
The most common objection I hear to DSLs is what I call the language cacophony problem
: the concern that languages are hard to learn, so using many languages will be much more complicated than using a single one.
Even if a DSL might help, sometimes it would just be too much effort to build and maintain for the marginal benefit.
The ghetto language problem is a contrast to the language cacophony problem. Here, we have a company that’s built a lot of its systems on an in-house language which is not used anywhere else. This makes it difficult for them to find new staff and to keep up with technological changes.
The usefulness of a DSL is that it provides an abstraction that you can use to think about a subject area. Such an abstraction is really valuable; it allows you to express the behavior of a domain much more easily than if you think in terms of lower-level constructs.
With a blinkered abstraction, you spend more effort on fitting the world into your abstraction than the other way around. You see this when you come across something that doesn’t fit in with the abstraction—and you burn time trying to make it fit, instead of changing the abstraction to easily absorb the new behavior.
Figure 3.1 The overall architecture of DSL processing that I usually prefer
So the differences between internal and external DSLs lie entirely in parsing, and indeed there are many differences in detail between the two.
In the external syntax, it looked something like this:
events
doorClosed D1CL
drawerOpened D2OP
end
We can take a similar view in the Ruby internal DSL.
event :doorClosed "D1CL"
event :drawerOpened "D2OP"
Whenever you look at a script like this, you can imagine that script as a hierarchy; such a hierarchy is called a syntax tree
(or parse tree).
Figure 3.2 A syntax tree and a semantic model are usually different representations of a DSL script.
A grammar
is a set of rules which describe how a stream of text is turned into a syntax tree
.
Let’s take a fragment of a state machine example:
commands
unlockDoor D1CL
end
state idle
actions {unlockDoor}
end
Here we see a common situation: A command is defined in one part of the language and referred to somewhere else. When the command is referred to as part of the state’s actions, we’re on a different branch of the syntax tree from where the command was defined. If the only representation of the syntax tree is on the call stack, then the command definition has disappeared by now. As a result, we need to store the command object for later use so we can resolve the reference in the action clause.
In order to do this, we use a Symbol Table
, which is essentially a dictionary whose key is the identifier unlockDoor and whose value is an object that represents the command in our parse. When we process the text unlockDoor D1UL, we create an object to hold that data and stash it in the Symbol Table under the key unlockDoor. The object we stash may be the semantic model object for a command, or it could be an intermediate object that’s local to the syntax tree. Later, when we process actions {unlockDoor}, we look up that object using the Symbol Table to capture the relationship between the state and its actions. A Symbol Table is thus a crucial tool for making the cross-references.
Figure 3.3 Parsing creates both a parse tree and a symbol table.
For many people, the central pattern of a fluent interface is that of Method Chaining
. A normal API might have code like this:
Processor p = new Processor(2, 2500, Processor.Type.i386);
Disk d1 = new Disk(150, Disk.UNKNOWN_SPEED, null);
Disk d2 = new Disk(75, 7200, Disk.Interface.SATA);
return new Computer(p, d1, d2);
With Method Chaining
, we can express the same thing with:
computer()
.processor()
.core(2)
.speed(2500)
.i386()
.disk()
.size(150)
.disk()
.size(75)
.speed(7200)
.sata()
.end()
Here is the same thing using a sequence of method call statements, which I call a Function Sequence
:
computer();
processor();
core(2);
speed(2500);
i386();
disk();
size(150);
disk();
size(75);
speed(7200);
sata();
The fact that a fluent interface is a different kind of interface to a command-query one can lead to complications. If you mix both styles of interface on the same class, it’s confusing. I therefore advocate keeping the language-handling elements of a DSL separate from regular command-query objects by building a layer of Expression Builders
over regular objects. Expression Builders are objects whose sole task is to build up a model of normal objects using a fluent interface—effectively translating fluent sentences into a sequence of command-query API calls.
There are a number of patterns for combining functions to make a DSL. First, Method Chaining:
computer()
.processor()
.core(2)
.speed(2500)
.i386()
.disk()
.size(150)
.disk()
.size(75)
.speed(7200)
.sata()
.end()
Then, Function Sequence
:
computer();
processor();
core(2);
speed(2500);
i386();
disk();
size(150);
disk();
size(75);
speed(7200);
sata();
Both Function Sequence and Method Chaining require you to use Context Variables
in order to keep track of the parse. Nested Function is a third function combination technique that can often avoid Context Variables. Using Nested Function
, the computer configuration example looks like this:
computer(
.processor(
.core(2),
.speed(2500),
.i386
),
.disk(
.size(150)
),
.disk(
.size(75),
.speed(7200),
.
)
);
- A
Function Sequence
works well for defining each element of a list. It keeps each computer definition well separated into statements. - The
Nested Function
for each computer eliminates the need for a Context Variable for the current computer, as the arguments are all evaluated before the computer function is called. In general, Nested Function makes it safer to use global functions, as it’s easier to arrange things so the global function just returns an object and doesn’t alter any parsing state. - If each processor and disk have multiple optional arguments, then that works well with
Method Chaining
.