html5 backend: everything

At the Mozilla/Berkeley meeting on 2/10, an important concern was how to use our FTL synthesizer for difficult features of CSS such as tables, floats, and text. As promised, this document shows how we generated code for the core of automatic table layoutHTML4 Tables. We were able to automatically generate parallel code and, by the same reasoning, see no obstacle for generating incremental code. We did have to slightly modify our runtime library and have been planning a language extension to automate this modification.

I picked two features from the CSS standard to implement. On the left, you can see them computed live by a JavaScript engine we generated, and on the right, by the browser's native layout engine:

In total, we only edited the specification (see above) and the runtime. Our runtime edits were to use a breadth-first traversal for traversing a table and, to lookup the children of a column, search table rows for cells with the corresponding column number attribute. We did not have to add table-specific code into the synthesizer (the offline scheduling analysis) nor the code generator. Furthermore, we are extending the specification language to better handle the patterns we encountered, which would simplify the specification and eliminate some if not all of the runtime modifications.

The rest of this document overviews our HTML5 implementation, one of our ideas for an FTL extension for an even cleaner specification, and shows our code.

Solution: computing the table grid with list function calls

Unsurprisingly, our specification traverses the cells in top-down, left-to-right order. For each row, it computes what columns its cells are placed in as a function of the list of columns that are still occuppied by preceding cells. The next row is given the columns that are occupied after adding cells on the current row, etc. Our specification of this behavior is interesting in that it is just calls to functional list manipulation methods written in our host language (C++/JavaScript):

//Specification in an idealized syntax
class TableBox {
  rows[-1].colAssignment := emptyColumnList(colCount);
  rows[i >= 0].colAssignment := 
    columnsAppendRow(
      rows[i - 1].colAssignment, 
      rows[i].cells, 
      rows[i].rowNum);

//Equivalent specification in FTL's current surface syntax
class TableBox (shrinkToFitHeightWidth, strokeBox) : Node {
  loop rows {
    rows.colAssignment := 
      fold 
        emptyColumnList(colCount) 
        .. 
        columnsAppendRow(
          rows$-.colAssignment, 
          rows$i.cells, 
          rows$i.rowNum);

The specification on the left uses a slightly cleaner syntax than our FTL prototype (shown on the right), but the important part is the calls to functions emptyColumnList() and columnsAppendRow(). As long as emptyColumnList() and columnsAppendRow() implement a functionally pure interface, attribute grammar techniques for automatic parallelization and incrementalization still apply. For example, instead of a destructive append that mutates a list, we use a functional append. If a script adds a cell to row 3, incremental evaluation could therefore safely reuse rows[2].colAssignment to recompute rows[3].colAssignment without first recomputing rows[-1,0,1].colAssignment.

Solution: computing over a dynamically generated graph

The grid is stored in an attribute, so we simply propagated the grid to all the table nodes as an attribute (cellsready). We were then able to declare the dependency on cellsready to the column computations:

The synthesizer now knows to schedule relX and absX computations only after the grid cellsready is computed.

Solution: computing over a DAG

A modest proposal

This specification provides two important pieces of information. First, the synthesizer now knows that any computation over a column's cells depends on first being able to compute the result of the selector, which in turn requires having already computed cell attribute column and column attribute colNum. Second, the localized DAG structure of tables is exposed, so the code generator now knows to use a breadth-first traversal to topologically evaluate any table/rows/columns/cells region of a document.

Demo: mixing tables with other elements

The following (live) rendering uses several types of nodes, such as line wrapping boxes. Border color denotes node type:

Blue: <WrapBox>
Black: <Leaf>
Gray: <Cell>
Orange: <Cell rowspan="2">
Pink: <Cell colspan="2">
Green: <Row>
Red: <Column>
Light gray: <TableBox>
Light green: <HBox>
Purple: <VBox>

Most nodes are by default shrink-to-fit and all can have their size overriden by an XML attribute:

Document source

The document is similar to standard HTML. Most attributes, such as the x and y position, are solved by the layout engine. We use CSS selectors to set some basic attributes:

Other attributes are set as attributes directly in the XML:

asdf

Widget sources

The widget is mostly specified in FTL. Note the (optional) use of scheduling constraints at the top and, to allow foreign functions to compute some values, phantom attribute sections for rows and columns.

Monkey patch

After the layout engine is loaded at runtime, we run the following monkey patches:

The second time a table is visited, a monkey patched call to dynamically create implicit column cells.
For when a column is visited and its cells are examined, swap the local getChildren function to instead find cells that are children of rows and have the right column number.
Modify the global depth first visit order within tables to use breadth-first for top down visits in order to guarantee all rows and columns are visited before the cells. The reverse is used for bottom up traversals.

Table ADT

Basic list/array manipulation functions for the grid. They use mutation etc. internally, but implement a functional (non-destructive) interface.

Generated layout engine

The code here is automatically generated from the specification above; we did not modify it at all.

It is fairly naive for the HTML5 backend. Of note:

The top section is code to compute individual attribute values. The host language's compiler should inline these.
Demonstrating the many uses of code generation, we also emit logging code to explain how a layout is computed. You can see a logging version of this demo by changing line 91 of the HTML (change if (false) { ... to true) and examing the output in the Firebug console.
The empty code blocks correspond to phantom attributes described above.
The bottom section is visitor dispatch code. Inherit means a top down traversal and synthesize means bottom up.

span inside a cell, such that the cell spans two columns of the table		inline-block inside a cell, such that the cell spans two rows of the table
span inside a singleton cell	span inside singleton cell

Synthesizing an automatic table layout solver with FTL

Contents