Knowledge Sharing

3.15.2024

Build PDF Documents with React on the Server

Generating PDF documents or reports is a quite common task in software development. There are many use cases for that: hospital reports, company time logs, invoices, you name it. And, trust me, every developer is facing the same set of issues for the task.

And, trust me, every developer is facing the same set of issues:

What approach to use for PDF generation?
How to manage the page breaks?
Is it possible to have the final document preview immediately?
Do we need to have a Java/C#/Python or any other language-based server or can we do it with JavaScript?
Should we consider some third-party solution and whether it fulfills all our requirements?
Maybe it should be on the Front End (client) side instead of the Back End?

Those, and many other questions are the common thing when it comes to PDF generation. The goals vary, so do the answers. As a React development company, we have a single goal – create a generic approach for building the PDF document using React and JavaScript (well, TypeScript to be specific) on the server with the instant preview on the UI.

We'll speak about the approach we used successfully on one of our projects – healthcare web service for extensive EEG diagnosing. The project required the creation of a unified way of describing the state of the patient in neurology, data filled by doctors and clinicians had to be converted into a dynamic PDF report of the patient’s study. So in this article, we'll walk through a full cycle of PDF generation, from the basic steps to the final result of our approach.

Rolique Healthcare Software Engineering Services — Case: HOLBERG

I hope this would be a journey you’d enjoy!‍‍

Questions need to be answered first

Why should it be on the server if we can print it from the UI in the browser?
Rendering from the client’s machine, in the client’s browser is challenging. There would be no consistency between the two users because the final result would depend on many factors like screen resolution, availability of the fonts, whether it’s a retina or not, also there could be differences between the browsers which users are potentially using. This would work for a very low level of requirements or, as a first version, since it is quite easy to implement.

Should I use some other programming language on the server or third-party service?
‍The third-party services would work fine for generic PDFs, maybe with quite some level of flexibility, but most likely never as flexible as a custom solution. This is a tradeoff for using ease. Also, using other languages restricts us from the ability to reuse the React components for both PDF reports and instant preview. Later, we will talk about this more definitely, but for now — we can assume, that adding another language is increasing the complexity of the codebase and requires either us to learn/know that language or to hire another developer specifically for this.

Is there a way to use React as a content builder?
Yes, absolutely! React has the ability to render on the server-side into an HTML string, which we can easily convert into a final PDF afterward.Hence, if we are using React and by having a dedicated set of components for the documents we can have an instant preview of the final document and also reuse those components on the server to create a PDF version. Don’t you think this is really neat, huh?

Where to start?

Brief:‍

Our goal is to acquire a tool that can easily create PDF files reusing the same components that we use on the UI application.
Shortly, we will create 3 projects: the UI application, Shared component, and PDF Report service. Idea is to have defined building blocks of the possible PDF, separate the data layer, and have the instant preview on the UI application.
Now, let’s get deep into the details.

Step one: Project structure

It is important to have the components extracted to a separate codebase because we would need to import them into both Web UI and Report service. The way you have to implement this does not really matter. It could be a private package for shared components or monorepo, or any other approach. The main goal is to be able to access the same shared codebase on both ends, UI and Report Service. Here is a simple preview of that structure.

codebase dependencies structure Rolique Web UI — Codebase dependencies structure

In my example, I used a basic Lerna monorepo setup with Yarn workspaces, the final code repo will be linked in the end. There are 3 packages I have, as on the diagram above, so the ^component the package is being imported to the Web UI and Report service.

Step two: Setup basic Web UI layout

For demonstration purposes, we can have a really simple layout on the screen. Let’s divide it into two halves, where the left one is a data source and the right one is the instant PDF document preview. Right now we will just have a basic working inputs form with minimal validation. Actually, the form is a copy-paste example from the ^{react-hook-form}, which I’m using here as well.

The other half is the empty preview container which we will update later. For now, it is just a placeholder with the dimensions of the A4 document format.

Step three: Basic report components setup

During this step, we will set up the shared components package. I used Create React Library, again, for simplicity. The setup does not matter in our case, our only
need — is to have a final build file with the components to reuse on the client and the server sides.

What is actually important within this step is to define what kind of components we are willing to have available for the PDF documents. So, now we can freely transition from the concept of the Component to a Block.

Then what is the difference between a Component and a Block, you might ask? Technically, the block is a recursive serializable data structure. It describes what kind of UI element to use, where to pull the data for it, and if there are any specific attributes available for it. Any kind of UI element that we want to have on the PDF document — we should describe it as a block, first. In other words — there is an ^BaseBlock abstract class and plenty of concrete classes for every generic element. Think of the blocks as a description of what we want to see, like a ^Title block or ^Image block, maybe also a block for the list of items.

Every block has a list of required properties: ^Identifierand ^Component. Additionally, we have a ^accessor, ^metadata and items.

With ^Identifier enum we can deserialize the block from JSON to the correct concrete class implementation, where there would be a ^Component property, function of an actual UI for this block. With _accessorwe can get the value for this specific instance of the block (think of the Title block, there could be many different titles, but the block is one, so we only change the accessor). Blocks are recursive, so every one of them (almost) has a ^itemsproperty, a list of children blocks. Last but not least, is the ^metadataproperty, we need it to describe some of the specific details of the block instance. For example, a Title block can have different styling like bold, italic, etc. So, this is what the metadata is.

By leveraging the power of blocks we can build a Schema for the type of document we want. We can define several schemas, each one is built from many blocks. One schema could describe a document structure for the employee Time Logs, another schema describes the structure for an Invoice. Both are made with the same blocks but differently structured.

By having the serializable Block data structure we are obtaining several benefits:

The layout of the report can be constructed dynamically.
We can consume schema not only in our service but, maybe, on the other service built with different programming languages, in case, we will need to support that in prospect.
Schema can be stored in the database as a simple JSON and transferred over the network easily.
We create a restricted set of supported blocks, so no magic here. It also helps to avoid vague product requirements.
Scoped implementation, preventing chaotic code structures that are unreadable.

Another thing left, is to render the schema in React. And guess what? It is so simple because every block from the rendered perspective is the same, it does not bother with the actual block implementation, it just renders.

Since the structure is recursive we render it in a recursive way. Key optimization is important only on the same nesting level, so for simplicity, we use an identifier with an index. But you can have some ID instead, or actual value.

And, finally, the actual schema is just a multi-level JavaScript object of block instances. We will feed it into the ^RenderBlock component and all the magic is done there. Neat!

Step three: Data provider

It is a huge deal to separate the Schema from the actual data, so they can be dynamic. Who would need a layout that is tightly coupled to the data, right? That is why every block has a accessor property. By the accessor, we can literally access the data needed for this specific instance of the block. Let’s say, we can have several titles, so the possibles accessors are ^firstName, ^lastName, etc.

The React way of doing this is simple — the Context API. Our data object would be mostly a flat object, so we avoid complex accessors (even though it is possible, but harder to maintain). The provider component would wrap the RenderBlock component. This way the data is provided by the DataProvider and is easily accessible via the hooks inside of the block’s implementation, and the layout (Schema) is provided separately. Well, you should already get the idea and the benefit.

Another great thing, we were able to achieve, is that TypeScript won’t let us push unknown accessor to the block instance. We defined our Data object with the list of available accessors and their type. Let’s try to use an unknown accessor and see what happens.

Thanks, TypeScript! This is very useful to prevent developer mistakes. Of course, this is only a compile-time error, but it helps anyway. If required — we can have a runtime check with a bit more work there, but now, that is not our goal.

Step four: Set up the server-side

We can go with either a straightforward (express) or structured (Nest.js) approach. I prefer using Nest.js for the back-end services because it has a clean way of doing things out of the box. Not only that, this is a framework, so I don’t bother with additional setup steps.

What we actually need for the simple use case — is a single endpoint. To that endpoint, we are going to push the body with Schema and Data. And then, the rest of the work is moved to the service.

The strategy of getting an actual PDF result is to follow several simple steps:

Accept the Schema and Data through the REST (no necessarily) API
Parse the body and validate it. (Check for unknown identifiers, etc)
Deserialize Schema JSON into actual block instances
Feed the Schema to the ^RenderBlock component and Data to^DataProvidercomponent
Render everything to the HTML string (or stream). Optionally include the necessary CSS code
Sanitize the HTML result to cover the security risks
Using Puppeteer render HTML in the headless Chrome and extract PDF file with the supported API
Push PDF file as a stream to the client in the HTTP response

These are basic steps to create a PDF this way. Of course, we can have some extra steps in-between, but it’s another story. The main idea is to create a final HTML and convert it into a PDF using Puppeteer.

If you are not familiar with Puppeteer — it is an API abstraction layer over the headless Chrome or Chromium browser.

What is also important in our case, is to run the service inside of the Docker container. This way we can have a consistent resulting PDF document that does not rely on the type of server it runs on.

Validation of the body is quite simple for use. With the data structure that we have here, it is important to only check if the block identifier from the Schema is the one that we know, otherwise, it is an error in the payload. We would throw an exception if the body is invalid. In Nest.js this is done via validation pipe using ^{class-validator}library.

The other part is deserialization. At this point, we can be sure that the body is valid, so we can simply deserialize that plain object into the actual block. We have already prepared for this by providing the function in the component library for deserialization. It makes sense to keep that logic in the components project, this way it is incapsulated and the service does not need to know too much about the data. Again, in Nest.js we are using ^{class-transformer} to convert every block into a plain block (it only has an instance, but does not implement any UI). Plain blocks are useful because we can run validation on them. After, we convert them into the correct blocks using ^{deserializeSchema} there would be no problem feeding that into the ^RenderBlock component.

Ok, great, we have the valid and transformed data at this point, so it can be pushed into a render step.

React library lets us use the ^{react-dom/server} package, with which we are able to render the UI, but specifically on the server. It could be either string or stream, but we will go with HTML string for simplicity. So, the structure is clear here, we render the Data provider and schema, then store the resulting HTML string into a variable. We won’t use JSX to prevent the configurations to support that in the Nest.js setup. But you could if there is a need. Once the HTML is ready, you just have to add a proper HTML wrapper, I mean head, body, styles, if necessary.

Also, you can get the support of the images, if you convert them into a data URL.

The whole HTML is ready. For safety, you can also run it through the decoder like ent. Now, we can push it into the puppeteer to get the PDF. There is also the ability to add the HTML template for the header and footer, but it only supports the inline CSS styles there.

To open the page with the UI for PDF, we should turn it into the data URL with base64 encoding.

The result is a PDF buffer which we can then simply feed into the response stream. You can, of course, feed it as a plain HTML string, but with the support of the stream, we can reduce memory load slightly. With the file stream, we should specify the content type, content length, and content disposition with the final file name.

Step five: connect UI and Shared component

Since both UI and Shared components are ready for use, now, we should start connecting them. Gladly, with Lerna, we can import components easily between the projects, the same way as with the PDF service.

We can now replace the PDF preview container content with the real preview. So, as it was mentioned before — data is separate and the template is separate.

We made a lot of effort to make the preview easy to use. Also, for simplicity, we reuse the default template with titles included. The results are noticeable! So now, we can recognize every change from the form reflected on the preview. You can also create separate pages emulation if necessary, of course. But it is not in our example case.

Step six: connect UI and Service

With all we have at this point, not much is left to do. Let’s connect the UI with the service and see the final results as PDF!

For this simple case, it is enough just to use ^fetch method to call the service. As a result, we will open the generated PDF document in the new window. But, you, of course, can save the resulting Blob in any way you want.

Now you have it! PDF file using the React components!

‍

Conclusion:

You went through the full cycle of the PDF generation, from the basic steps to the final result.

Sure thing, there are many other possible ways of generating the PDFs, but the way, you went through, is a good fit for the React-based UI.

We were able to reuse the components on both UI and PDF Service, which gave us an immediate preview of the final file. We also created a predictable structure of blocks (Schema), which guarantees the consistency of the resulting reports. By leveraging headless chrome with puppeteer we were able to produce the PDF file in the end. Both sides were glued using the REST API with the help of the Nest.js framework, and TypeScript.

With this approach, you can create any type of block, either dynamic or static, complex or simple, reusable or not. The main thought was to show one of the possible ways to solve the problem.

We haven’t really covered the page breaks issue, but with this approach, you can use the ^page-break-* CSS properties, and it’s up to the requirements of how to place them.

All the code is available on GitHub. Feel free to reach me with suggestions, proposals, or even critiques :)

PS: The repository also contains the Docker setup for the service, code has additional things included (like API swagger and etc), examples in the topic are very simplified.