A demonstration of how to build and publish pages with the baker build tool
A demonstration of how to build and publish pages with the baker build tool.
The Los Angeles Times uses baker to create the static pages published at latimes.com/projects. The Times system relies on a private version of a repository much like this one. This simplified example publishes staging and production versions to public buckets on Amazon S3.
push
event via GitHub Actionrelease
event via GitHub ActionWith a little configuration, you can use this template to easily publish a page. With a little customization, you can make it look any way youâd like. This guide will introduce you to the basics.
The first step is to click GitHubâs âuse this templateâ button to a make a copy of the repository for yourself.
Youâll be asked to provide a slug for your project. Once thatâs done, a new repository will be available at https://github.com/your-username/your-slug
.
Next youâll need to clone it down to your computer to work with the code.
Open up your terminal and cd to your code folder. Clone the project into your folder. This will copy the project onto your computer.
git clone https://github.com/your-username/your-slug
If that command doesnât work for you, it could be because your computer was set up differently and you need to clone to the repository using SSH. Try running this in your terminal:
git clone [email protected]:your-username/your-slug.git
Once the repository has finished downloading, cd into your-slug and install the Node.js dependencies.
npm install
Once the dependencies have been installed, youâre ready to preview the project. Run the following to start the test server.
npm start
Now go to localhost:3000
in your browser. You should see a boilerplate page ready for your customizations.
An alternative route is to create a new page with bluprint, the command-line scaffolding tool developed by the Reuters graphics department.
bluprint add https://github.com/datadesk/baker-example-page-template
mkdir my-new-page
cd my-new-page
bluprint start baker-example-page
Here are the standard files and folders that youâll find when you clone a new project from our page template. Youâll spend more time working with some files than others, but itâs good to have a general sense of what they all do.
The data folder contains relevant data for the project. We use this folder to store required information about every project â like what URL it should live at. You can also store a variety of other data types here, including .aml
, .csv
, and .json
.
The meta.aml
file contains important information about the page such as the headline, byline, slug, publication date and other fields. Filling it out ensures that your page displays correctly and can be indexed by search engines. A full list of all the attributes can be found in our reference materials. You can expand this to include as many, or as little, options as youâd like.
This folder that stores our siteâs base template and reusable code snippets. When youâre starting out, youâre unlikely to change anything here. In more advanced use cases, itâs where you can store code that is reused across multiple pages.
base.html
The base.html file contains all the fundamental HTML found on every page we create. The example here is rudimentary by design. Youâd likely want include a lot more in a real-world implementation.
The workspace is a place for you to put anything relevant to the project that doesnât need to be published on the web. AI files, bits of code, writing, etc. Itâs up to you.
This is used to store media and other assets such as images, videos, audio, fonts, etc. They can be pulled into the page via the static
template tags.
JavaScript files are stored in this folder. The main file for JavaScript is called app.js
, which you can write your code directly. Packages installed via npm
can be imported and run like any other Node.js script. You can create other files to write your JavaScript code in this folder, but you must make sure that the file is booted from app.js
.
Our stylesheets are written in SASS, a powerful stylesheet language that gives developers more control over CSS. If youâre not comfortable with SASS, you can write plain CSS into the stylesheets.
The styles folder consists of a stylesheet (app.scss
) where you can add all of your styles custom to your project, though sometimes you might want to make additional stylesheets and import them into app.scss
. This example project only include the bare minimum necessary simulate a simple site. Youâd likely want to start off with a lot more in a real world implementation.
The baker.config.js
file is where we put options that Baker uses to serve and build the project. It has been fully documented elsewhere in this file. With the exception of the domain
setting, only advanced users will need to change this file.
The default template for your page. This is where you will lay out your page. It uses the Nujucks templating system to create HTML.
These files track the Node dependencies used in our projects. When you run npm install
the libraries you add will be automatically tracked here for you.
This is a special directory for storing files that GitHub uses to interact with our projects and code. The .github/workflows
directory contains the GitHub Action that handles our development deployments. You do not need to edit anything in here.
Files stores in the assets directory are optimized and hashed as part of the deployment process. To ensure that your references to images and other static files, you should use the {% static %}
tag. That ensures the file is heavily cached when itâs published and that the link to the image works across all environments. Youâll want to use it for all photos and videos.
<figure>
<img src="{% static 'assets/images/baker.jpg' %}" alt="Baker logo" width=200>
</figure>
Structrued data files stored in your _data
folder are accessible via templatetags or JavaScript. In this demonstration, a file called example.json
has been included to illustrate whatâs possible. Other file formats like CSV, YAML and AML are supported.
Files in the _data
folder are available by their name within your templates. So, with _data/example.json
, you can write something like:
{% for obj in example %}
{{ obj.year }}: {{ obj.wheat }}
{% endfor %}
A common need for anyone building a project in Baker is access to raw data within a JavaScript file. Often this data is then passed along to code written using d3 or Svelte to draw graphics or create HTML tables on the page.
If the data youâre accessing is already available at a URL you trust will stay live, this is easy. But what if it isnât, and it is data youâve prepared yourself?
Itâs possible to access records in your _data folder. The only caveat is the job of converting this file into a usable state is your responsibility. A good library for this is d3-fetch
.
To build the URL to this file in a way Baker understands, use this format:
import { json } from 'd3-fetch';
// the first parameter should be the path to the file
// the second parameter *must* be âimport.meta.urlâ
const url = new URL(â../_data/example.jsonâ, import.meta.url);
// Call it in
const data = await json(url);
Another approach is to print the data into your template as a script
tag. The jsonScript
filter takes the variable passed to it, runs JSON.stringify
on it, and outputs the JSON into the HTML within a <script>
tag with the ID set on it you pass as the parameter.
{{ example|jsonScript('example-data') }}
Once that is in place, you can now retrieve the JSON stored in the page by ID in your JavaScript.
// grab the element jsonScript created by using the same ID you passed in
const dataElement = document.getElementById('example-data');
// convert the contents of that element into JSON
// do what you need to do with âdataâ!
const data = JSON.parse(dataElement.textContent);
While the URL method is recommended, this method may still be preferred when you are trying to avoid extra network requests. It also has the added benefit of not requiring a special library to convert .csv
data into JSON.
You can generate an unlimited number of static pages by feeding a structured data source to the createPages
option in the baker.config.js
file. For instance, this snippet will generate a page for every record in the example.json
file.
export default {
// ... all the other options above this one have been excluded to make the point
createPages: createPages(createPage, data) {
// Grab the data from the _data folder
const pageList = data.example;
// Loop through the records
for (const d of pageList) {
// Set the base template that will be used for each object. It's in the _layouts folder
const template = 'year-detail.html';
// Set the URL for the page
const url = `${d.year}`;
// Set the variables that will be passed into the template's context
const context = { obj: d };
// Use the provided function to render the page
createPage(template, url, context);
}
},
};
That could be used to create URLs like /baker-example-page-template/1775/
and /baker-example-page-template/1780/]
with a single template.
Before you can deploy a page created by this repository, you will need to configure your Amazon AWS account and add a set of credentials to your GitHub account.
First, youâll need to create two buckets in Amazonâs S3 storage service. One is for your staging site. The other is for your production site. For this simple example, each should allow public access and be configured to serve a static website. In a more sophisticated arragenment, like the one we run at the Los Angeles Times, the buckets could be linked with registered domain names and the staging site shielded from public view via an authentication scheme.
The names of those buckets should then be stored as GitHub âsecretsâ accessible to the Actions that deploy the site. You should visit your settings panel for your account or organization. Start by adding these two secrets.
Name | Value |
---|---|
BAKER_AWS_S3_STAGING_BUCKET |
The name of your staging bucket |
BAKER_AWS_S3_STAGING_REGION |
The S3 region where your staging bucket was created |
BAKER_AWS_S3_PRODUCTION_BUCKET |
The name of your production bucket |
BAKER_AWS_S3_PRODUCTION_REGION |
The S3 region where your production bucket was created |
Next you should ensure that you have an key pair from AWS that has the ability to upload public files to your two buckets. The values should also be added to your secrets.
Name | Value |
---|---|
BAKER_AWS_ACCESS_KEY_ID |
The AWS access key |
BAKER_AWS_SECRET_ACCESS_KEY |
The AWS secret key |
A GitHub Action included with this repository will automatically publish a staging version for every branch. For instance, code pushed to the default main
branch will appear at https://your-staging-bucket-url/your-repo/main/
.
If you were to create a new git branch called bugfix
and push your code, you would soon see a new staging version at https://your-staging-bucket-url/your-repo/bugfix/
.
Before you send your page live, you should settle on a final slug for the URL. This will set the subdirectory in your bucket where the page will be published. This feature allows The Times to publish numerous pages inside the same bucket with each page managed by a different repository.
Step one is to enter the slug for your URL into the _data/meta.aml
configuration file.
slug: your-page-slug
Itâs never a bad idea to make sure your slug hasnât already been taken. You can do that by visiting https://your-production-bucket-url/your-slug/
and ensuring it returns a page not found error.
If you want to publish your page at the root of your bucket, you can leave the slug null.
slug:
Next you commit your change to the configuration file and make sure itâs pushed to the main branch on GitHub.
git add _data/meta.aml
git commit -m âSet page slugâ
git push origin main
Visit the releases section of your repositoryâs page on GitHub. You can find it on the repoâs homepage.
Draft a new release.
There youâll create a new tag number. A good approach is to start with an x.x.x format number that follows semantic versioning standards. 1.0.0 is a fine start.
Finally, hit the big green button at the bottom and send out the release.
Wait a few minutes and your page should show up at https://your-production-bucket-url/your-slug/
.
The baker test server can log with greater detail by starting with the following option.
DEBUG=1 npm start
To limit the logs to baker run:
DEBUG=baker:* npm start
If your build is unsuccessful, you can try creating the static site yourself locally via your terminal. If there are errors with the page building, they will be printed out to your terminal.
npm run build
Baker comes with a set of global variables that are the same on every page it creates, and another set of page-specific variables that are set based on the current page being created. You can use these variables to conditionally add content to pages or filter out unrelated data based on the current page.
NODE_ENV
The NODE_ENV
variable will always be one of two values: development
or production
. It corresponds to what type of build is being run on the page.
When you run npm start
, you are in development
mode. When you run npm run build
, you are in production
mode.
This is most useful for adding things to pages only when youâre in development
mode.
{% if NODE_ENV == âdevelopmentâ %}
<p>Youâll never see this on the live site!</p>
{% endif %}
DOMAIN
The DOMAIN
variable will always be the same as the domain
option passed in baker.config.js
, or an empty string if one was not passed.
PATH_PREFIX
The PATH_PREFIX
variable will always be the same as the pathPrefix
option passed in baker.config.js
, or a single forward slash (/
) if one was not passed.
page.url
The project-relative URL to the current page. Will include the pathPrefix
if one was provided in the baker.config.js
file â in other words, it will account for any project pathing being done and point at the correct page in the project.
page.absoluteUrl
The absolute URL to the current page. This combines the domain
, pathPrefix
and current path into a full URL. This is currently used to output the canonical URL and all URLs for social <meta>
tags.
<link rel="canonical" href="{{ page.absoluteUrl }}">
page.inputUrl
This is the path to the original template used to create this page relative to the current projectâs directory. If you have an HTML file located at page-two/index.html
, page.inputUrl
would be page-two/index.html
.
page.outputUrl
This is the path to the HTML file that was output to create this page relative to the _dist
folder. If you have an HTML file located at page-two.html
, page.outputUrl
would be page-two/index.html
.
Every Baker project we work on includes a baker.config.js
file in the root directory. This file is responsible for passing information to Baker so it can correctly build your project.
export default {
// the directory where assets are
assets: âassetsâ,
// createPages
createPages: undefined,
// the data directory
data: â_dataâ,
// an optional custom domain to be used in building paths
domain: undefined,
// a path or glob of paths of each JavaScript entrypoint
entrypoints: âscripts/app.jsâ,
// the overall input directory, typically the current folder
input: process.cwd(),
// where the template layouts, macros and includes are located
layouts: â_layoutsâ,
// an object with the keys and values of global variables to be
// passed to all Nunjucks templates
nunjucksVariables: undefined,
// an object of key (name) + value (function) for adding custom
// filters to Nunjucks
nunjucksFilters: undefined,
// an object of key (name) + value (function) for adding custom
// tags to Nunjucks
nunjucksTags: undefined,
// where to output the compiled files
output: â_distâ,
// a prefix to add to the beginning of every resolved path, how
// slugs work
pathPrefix: â/â,
// an optional directory to put all assets within, rarely used
staticRoot: ââ,
};
default: âassetsâ
This tells Baker which folder to treat as the assets directory. You likely do not have to change this.
default: undefined
createPages
is an optional parameter that makes it possible to dynamically create pages using data and templates in the project.
export default {
// âŚ
// createPage - pass in a template, an output name, and the data context
// data - the prepared data in the `_data` folder
createPages(createPage, data) {
for (const title of data.titles) {
createPage('template.html', `${title}.html`, {
context: { title },
});
}
},
};
default: â_dataâ
The data
option tells Baker which folder to treat as its data source. You likely will not need to change this.
default: undefined
The domain
option tells Baker what to use when it builds absolute URLs. The bakery-template
presets this to https://www.latimes.com
.
default: âscripts/app.jsâ
The entrypoints
option tells Baker what JavaScript files to treat as starting points for script bundles. This can be a path to a file or a file glob, making it possible to create multiple bundles at the same time.
default: process.cwd()
The input
option tells Baker what folder to treat as the main directory for the entire project. By default this is the folder the baker.config.js
file is in. You likely will not need to set this.
default: â_layoutsâ
The layouts
option tells Baker where the templates, includes and macros are located. By default this is the _layouts
folder. You likely will not need to set this.
default: undefined
You can use nunjucksFilters
to pass in your own custom filters. In the object each key is the name of the filter, and the function value is what is called when you use the filter.
export default {
// ...
// pass an object of filters to add to Nunjucks
nunjucksFilters: {
square(n) {
n = +n;
return n * n;
}
},
}
{{ value|square }}
default: undefined
You can use nunjucksTags
to pass in your own custom tags. These differ from filters in that they make it easier to output blocks of text or HTML.
export default {
// ...
// pass an object of filters to add to Nunjucks
nunjucksTags: {
doubler(n) {
return `<p>${n} doubled is ${n * 2}</p>`;
}
},
};
{% doubler value %}
default: â_distâ
The output
option tells Baker where to put files when npm run build
is run. By default this is the _dist
folder. You likely will not need to set this.
default: â/â
pathPrefix
tells Baker what path prefix to add to any URLs it builds. If domain
is also passed, it will be combined with pathPrefix
when building absolute URLs. You typically will not set this manually â it is used during deploys for building URLs with project slugs.
default: ââ
The staticRoot
option instructs Baker to put all assets in an additional directory. This is useful for projects that need to have unique slugs across every single page without nesting, allowing them to all share static assets. However â this is a special case and requires a custom setup for deployments. Do not attempt to use this without a good reason.