Evolving the Node HTTP Client with undici

Rate this content
Bookmark

What are some of the problems of the current Node core HTTP client and how can we move beyond them? A talk about how undici tries to bring the next step in client side HTTP in Node. Includes some practical advice to using undici and common pitfalls with some commentary in regards to API design decisions and future plans with some extra Fetch API bonus.a

Robert Nagy
Robert Nagy
35 min
18 Feb, 2022

Comments

Sign in or register to post your comment.

Video Summary and Transcription

The Talk discusses the current state of the Node HTTP client and the problems it faces, including the lack of support for HTTP pipelining and the intrinsic link between request and response objects. The speaker introduces the library Indichy, which aims to provide a more user-friendly API for HTTP in Node. The Talk highlights the performance advantages of using WebAssembly in the Umidigi HTTP client and the plans to include it in Node Core. The speaker also mentions the support for signals and the ability to post requests in Umidigi. Additionally, the Talk covers the customization options in Undici, the different types of dispatchers available, and the potential inclusion of Indichy in Node Core. Future plans include support for HTTP 2 and 3, DNS lookup enhancements, and improvements to fetch and pool scheduling. The Talk concludes by discussing the differences in TCP implementations across operating systems and the considerations for adding Web APIs and standards to Node Core.

1. Introduction to Node HTTP Client and Indichy

Short description:

Hello, everyone. My name is Robert Nagy. I'm the lead developer at Next Edition and a frequent contributor to Node.js and also a TSC member. I'm going to talk about the current state of the Node HTTP client and the problems we see. The response object is not actually a Node stream, it lacks support for HTTP pipelining, and the request and response objects are intrinsically linked. We have tried fixing these issues multiple times, but it's difficult without causing disturbance. A few years ago, Matteo Macalina created the library Indichy, which aims to provide a more user-friendly API for HTTP in Node.

Hello, everyone. My name is Robert Nagy. I'm the lead developer at Next Edition and a frequent contributor to Node.js and also a TSC member. And I'm going to talk a little bit about some work we've been doing on the HTTP client in Node using a library we created called Bitchy.

So, first of all, let's talk a little bit about the current state of the Node HTTP client. I think most people are familiar with the HTTP request API. And people that are using NPM libraries like Got and Axios, etc., these are usually just wrappers around the HTTP request call that Node Core provides. Now, this has a long history and it has worked for a very long time. But those of us that have been working on maintaining this API feel we have kind of reached the end of the line of what we can do with reasonable effort with this API.

So what are some of the problems we see here? So first of all, the response object you receive from the request API is not actually a Node stream. It just pretends to be a Node stream. There are various reasons, both the compatibility and performance reasons for that. But there are slight differences that can cause weird issues if you're unlucky. Especially if you're using APIs that expect streams, and you wanted to use it seamlessly. It doesn't have support for HTTP pipelining. Pipelining is a feature of the HTTP protocol that can provide some significant performance advantages. Also, the request and the response object in this API are intrinsically linked. So if you destroy, for example, the request, even though it has completed, it will also kill the response. And this linkage is very difficult to fix or remove without breaking the entire ecosystem. It's entirely based on streams, which causes some limitation of what we can achieve in terms of performance. And a lot of the internals in the API are publicly accessible, and we have ecosystem modules that depend on internal implementation details. Back then, we didn't have symbols so it wasn't possible to actually hide things from users. And all of these things cause issues that we believe are very difficult to fix without causing undue disturbance in the entire ecosystem.

So a few years ago, Matteo or actually, another point here is we have tried fixing these issues multiple times, and that's caused problems and we had to revert them. I'm actually one of the persons that had spent a lot of time trying to resolve these issues. And it's quite disappointing when you are forced to revert them due to compatibility or performance regressions, et cetera. So here are some pull requests that just revert work that has been done towards this. So a few years ago, Matteo Macalina created this library called Indichy, which is a new take on what HTTP can and could, can be in Node. And I got involved a year or so ago, and I've done a lot of work making this production ready. So what are our aims? What is we are achieving or trying to achieve within Indichy? So we want a bit more user-friendly API so people don't have to go to a third-party library by default.

2. Node HTTP Client and Umidigi

Short description:

We have managed to replace the entire HTTP client outside of Node core by using WebAssembly, which provides performance advantages. Unleashy supports Keepalive and pipelining, addressing issues in Node Core. We have sorted out the problem with Keepalive in Umidigi, and achieved almost 10 times better performance compared to the Node Core client. We are developing Umidigi outside of core for later inclusion and have hidden all the internals behind symbols. We're also looking into implementing fetch on top of Umidigi. It's important to consume or destroy the body to release the connection, and we provide a helper function called dump. We have support for signals.

We don't want to have any native dependencies. So this is actually required for us to, in order to replace the entire HTTP client outside of Node core, we native dependencies. And the HTTP parsing node is HTTP, which is a native library. But we have managed to get around this by using WebAssembly, and it's working very well, and we do see actually some performance advantages with using WebAssembly.

Another important thing is we want to support Keepalive and pipelining, so I'm just quickly going to explain that. Keepalive is actually something that the node core client supports, but it's not enabled by default. And there are some things you have to think about this in Node Core that Unleashy tries to handle for you. So without pipelining, every request you do actually creates a new connection and closes that. With Keepalive, you can reuse the connection for subsequent requests. So you skip the overhead of closing and establishing a new connection. And with pipelining, it's actually possible to send multiple requests before the server has replied. And thereby, you can reduce some of the latency of your requests. And we have spent a lot of time making sure Unleashy supports this natively.

And as I mentioned, there is a problem with Keepalive that Nord Core doesn't handle, is that once a connection is established, there's a timeout that the server will keep that connection alive, waiting for further requests before closing the connection. Now, if you're unlucky, you could actually send a request at the exact moment the server's timeout happens and therefore the server will close the connection while your request is arriving. Now, some servers provide a Keepalive hint in their responses so you can actually find out for how long the server expects to keep the connection open, and therefore, the client can actually do stuff to avoid this unexpected closure happening. So that's something we've sorted out in Umidigi.

We have also looked at performance so with Umidigi we were able to achieve an almost 10 times better performance relative to the Node Core client, which I have to say is a rather good result. We are developing it outside of core at the moment for later inclusion. There are some advantages of this especially in terms of the implementation velocity development velocity and we've hidden all the internals behind symbols so we don't have a repeat of third-party libraries, depending on implementation details. And we're also looking into implementing fetch on top of Umidigi. So Umidigi's API is the most basic API is Umidigi request and you basically do an await Umidigi request where you get a body, a status code, and headers and the body is a node stream but we have also implemented some helper functions inspired by the fetch mixing body specifications so you have body.text.json.array buffer and you can just await those but otherwise the body is a normal node.js stream. So this is quite simple. One important note that I've noticed some people miss is that it's very important, even if you don't care about the body, you should consume it or destroy it to release the connection because the way it works with the keep-alive is unless the previous request has completed you can't actually process the next one so it's important to either destroy the body or dump it or consume it. We provide a helper called dump, which the downside of destroying the body is that that actually will cause the socket to be destroyed. We have a helper function here called dump which will actually try to read the entire body so that the connection can be reused but if the body or the response from the server exceeds a certain threshold then it chooses to eventually close the connection. So you don't have to download a gigabyte of the data before being able to reuse the connection. And yes, if you don't do this then the connection won't be freed for the next request. We have support for signals like any good citizen of the promise world.

QnA

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

It's a Jungle Out There: What's Really Going on Inside Your Node_Modules Folder
Node Congress 2022Node Congress 2022
26 min
It's a Jungle Out There: What's Really Going on Inside Your Node_Modules Folder
Top Content
Do you know what’s really going on in your node_modules folder? Software supply chain attacks have exploded over the past 12 months and they’re only accelerating in 2022 and beyond. We’ll dive into examples of recent supply chain attacks and what concrete steps you can take to protect your team from this emerging threat.
You can check the slides for Feross' talk here.
Towards a Standard Library for JavaScript Runtimes
Node Congress 2022Node Congress 2022
34 min
Towards a Standard Library for JavaScript Runtimes
Top Content
You can check the slides for James' talk here.
ESM Loaders: Enhancing Module Loading in Node.js
JSNation 2023JSNation 2023
22 min
ESM Loaders: Enhancing Module Loading in Node.js
Native ESM support for Node.js was a chance for the Node.js project to release official support for enhancing the module loading experience, to enable use cases such as on the fly transpilation, module stubbing, support for loading modules from HTTP, and monitoring.
While CommonJS has support for all this, it was never officially supported and was done by hacking into the Node.js runtime code. ESM has fixed all this. We will look at the architecture of ESM loading in Node.js, and discuss the loader API that supports enhancing it. We will also look into advanced features such as loader chaining and off thread execution.
Out of the Box Node.js Diagnostics
Node Congress 2022Node Congress 2022
34 min
Out of the Box Node.js Diagnostics
In the early years of Node.js, diagnostics and debugging were considerable pain points. Modern versions of Node have improved considerably in these areas. Features like async stack traces, heap snapshots, and CPU profiling no longer require third party modules or modifications to application source code. This talk explores the various diagnostic features that have recently been built into Node.
You can check the slides for Colin's talk here. 
Node.js Compatibility in Deno
Node Congress 2022Node Congress 2022
34 min
Node.js Compatibility in Deno
Can Deno run apps and libraries authored for Node.js? What are the tradeoffs? How does it work? What’s next?
Multithreaded Logging with Pino
JSNation Live 2021JSNation Live 2021
19 min
Multithreaded Logging with Pino
Top Content
Almost every developer thinks that adding one more log line would not decrease the performance of their server... until logging becomes the biggest bottleneck for their systems! We created one of the fastest JSON loggers for Node.js: pino. One of our key decisions was to remove all "transport" to another process (or infrastructure): it reduced both CPU and memory consumption, removing any bottleneck from logging. However, this created friction and lowered the developer experience of using Pino and in-process transports is the most asked feature our user.In the upcoming version 7, we will solve this problem and increase throughput at the same time: we are introducing pino.transport() to start a worker thread that you can use to transfer your logs safely to other destinations, without sacrificing neither performance nor the developer experience.

Workshops on related topic

Node.js Masterclass
Node Congress 2023Node Congress 2023
109 min
Node.js Masterclass
Top Content
Workshop
Matteo Collina
Matteo Collina
Have you ever struggled with designing and structuring your Node.js applications? Building applications that are well organised, testable and extendable is not always easy. It can often turn out to be a lot more complicated than you expect it to be. In this live event Matteo will show you how he builds Node.js applications from scratch. You’ll learn how he approaches application design, and the philosophies that he applies to create modular, maintainable and effective applications.

Level: intermediate
Build and Deploy a Backend With Fastify & Platformatic
JSNation 2023JSNation 2023
104 min
Build and Deploy a Backend With Fastify & Platformatic
WorkshopFree
Matteo Collina
Matteo Collina
Platformatic allows you to rapidly develop GraphQL and REST APIs with minimal effort. The best part is that it also allows you to unleash the full potential of Node.js and Fastify whenever you need to. You can fully customise a Platformatic application by writing your own additional features and plugins. In the workshop, we’ll cover both our Open Source modules and our Cloud offering:- Platformatic OSS (open-source software) — Tools and libraries for rapidly building robust applications with Node.js (https://oss.platformatic.dev/).- Platformatic Cloud (currently in beta) — Our hosting platform that includes features such as preview apps, built-in metrics and integration with your Git flow (https://platformatic.dev/). 
In this workshop you'll learn how to develop APIs with Fastify and deploy them to the Platformatic Cloud.
0 to Auth in an Hour Using NodeJS SDK
Node Congress 2023Node Congress 2023
63 min
0 to Auth in an Hour Using NodeJS SDK
WorkshopFree
Asaf Shen
Asaf Shen
Passwordless authentication may seem complex, but it is simple to add it to any app using the right tool.
We will enhance a full-stack JS application (Node.JS backend + React frontend) to authenticate users with OAuth (social login) and One Time Passwords (email), including:- User authentication - Managing user interactions, returning session / refresh JWTs- Session management and validation - Storing the session for subsequent client requests, validating / refreshing sessions
At the end of the workshop, we will also touch on another approach to code authentication using frontend Descope Flows (drag-and-drop workflows), while keeping only session validation in the backend. With this, we will also show how easy it is to enable biometrics and other passwordless authentication methods.
Table of contents- A quick intro to core authentication concepts- Coding- Why passwordless matters
Prerequisites- IDE for your choice- Node 18 or higher
Building a Hyper Fast Web Server with Deno
JSNation Live 2021JSNation Live 2021
156 min
Building a Hyper Fast Web Server with Deno
WorkshopFree
Matt Landers
Will Johnston
2 authors
Deno 1.9 introduced a new web server API that takes advantage of Hyper, a fast and correct HTTP implementation for Rust. Using this API instead of the std/http implementation increases performance and provides support for HTTP2. In this workshop, learn how to create a web server utilizing Hyper under the hood and boost the performance for your web apps.
GraphQL - From Zero to Hero in 3 hours
React Summit 2022React Summit 2022
164 min
GraphQL - From Zero to Hero in 3 hours
Workshop
Pawel Sawicki
Pawel Sawicki
How to build a fullstack GraphQL application (Postgres + NestJs + React) in the shortest time possible.
All beginnings are hard. Even harder than choosing the technology is often developing a suitable architecture. Especially when it comes to GraphQL.
In this workshop, you will get a variety of best practices that you would normally have to work through over a number of projects - all in just three hours.
If you've always wanted to participate in a hackathon to get something up and running in the shortest amount of time - then take an active part in this workshop, and participate in the thought processes of the trainer.
Mastering Node.js Test Runner
TestJS Summit 2023TestJS Summit 2023
78 min
Mastering Node.js Test Runner
Workshop
Marco Ippolito
Marco Ippolito
Node.js test runner is modern, fast, and doesn't require additional libraries, but understanding and using it well can be tricky. You will learn how to use Node.js test runner to its full potential. We'll show you how it compares to other tools, how to set it up, and how to run your tests effectively. During the workshop, we'll do exercises to help you get comfortable with filtering, using native assertions, running tests in parallel, using CLI, and more. We'll also talk about working with TypeScript, making custom reports, and code coverage.