Demystifying Memory Leaks in JavaScript

Rate this content
Bookmark

Today it is possible to write complex applications with only a handful of developers in a short time frame relying upon Frameworks and tooling. But what happens when the application crashes with out of memory errors? How is it possible to guarantee fast response times? These problems are still considered difficult to solve.

Let's dive into the Node.js internals and learn about profilers and the garbage collector. Understanding how the system works in depth let's you write code that is safer, faster and less error prone.

Let's make sure you always provide the best experience for everyone. Yourself and your customer. Let us find that memory leak and fix it.

Learnings

  •         Participants understand the Node.js memory handling and their shortcomings.
  •         Participants know when to profile their application to identify memory leaks and slow code.
  •         Participants are able to find and address most memory leaks.

FAQ

A memory leak occurs when a computer program incorrectly manages random access memory (RAM) such that memory which is no longer needed is not released. Over time, this can lead to a buildup of unused memory, potentially causing the program to crash due to exhaustion of available memory.

JavaScript manages memory through an automatic garbage collection process where memory is automatically allocated and freed. However, JavaScript still faces issues like memory leaks due to complexities in handling dynamic memory allocations in the heap memory.

Common causes include improper management of event listeners, closures that prevent garbage collectors from freeing up memory, and the misuse of file descriptors which may not be properly closed after use.

Memory leaks in Node.js can be detected using tools and flags such as '--inspect' for connecting to Chrome's inspector, 'trace GC' to watch garbage collector operations, and '--abort-on-uncaught-exception' for creating heap dumps when exceptions occur.

Handling large JSON objects efficiently involves ensuring they do not exceed memory type limits, using streaming techniques to process data in chunks rather than loading entire objects into memory at once, and being cautious with the allocation and lifespan of these objects in your program.

To fix a memory leak, you should monitor memory usage, profile your application using memory tools, take heap snapshots to identify leaking points, and use tools to compare memory usage over time. It's crucial to fix leaks in a controlled environment like staging before production deployment.

Ruben Bridgewater
Ruben Bridgewater
33 min
24 Jun, 2021

Comments

Sign in or register to post your comment.

Video Summary and Transcription

The Talk discusses demystifying memory leaks in JavaScript, covering topics such as memory allocation, typical memory leaks and issues, handling file descriptors and event listeners, tools and techniques for identifying memory leaks, fixing memory leaks and restarting applications, and Ruben's personal experience with memory leaks.

1. Demystifying Memory Leaks in JavaScript

Short description:

Hello, everyone. My name is Ruden Britschwurter. I'm a NodeJS TC member. I work as a principal software architect at MyConvolt, and I'm happy to be here today at the Node conference. Today I'm going to talk about demystifying memory leaks. So memory leaks are often considered something difficult and hard to solve. But is that actually so? And how can we ease the process? Memory leak is when a computer program incorrectly manages random access memory. In a way that memory, which is no longer needed, is not released. And the blue line is clearly a memory leak because over time you allocate just more and more memory without actually freeing it again. And this is bad. So how do we handle memory in JavaScript in particular? Because we do not have to worry about it, right? This is all done transparently and this is perfect. And memory is just freed. Why should there be a memory leak in the first place?

Hello, everyone. My name is Ruden Britschwurter. I'm a NodeJS TC member. I work as a principal software architect at MyConvolt, and I'm happy to be here today at the Node conference.

And today I'm going to talk about demystifying memory leaks. So memory leaks are often considered something difficult and hard to solve. But is that actually so? And how can we ease the process?

To get to that, I would first like to answer the question what a memory leak actually is. And I'm consulting Wikipedia to answer that question. So memory leak is when a computer program incorrectly manages random access memory. In a way that memory, which is no longer needed, is not released. As such, we are going to stack up more and more memory over time. And the program, in its worst case, might just crash because there is no more memory to allocate. And that's really the worst-case scenario. Or maybe you're in a cloud environment and you're going to have to pay much more money because you have an auto-scaling active and more and more memory is allocated there.

So here we have a graph which clearly shows you how memory leak would look like in comparison to the memory usage and the time the program runs. And the yellow line is a perfect program pretty much. It starts up and you allocate some memory and then there are some ups and downs and this is perfect for each program pretty much. Sometimes spikes might be higher or going a little bit down again, but on average it's the line, the flat line. And the blue line is clearly a memory leak because over time you allocate just more and more memory without actually freeing it again. And this is bad.

So how do we handle memory in JavaScript in particular? Because we do not have to worry about it, right? This is all done transparently and this is perfect. And memory is just freed. Why should there be a memory leak in the first place? So memory is divided into a stack memory and a heap memory. The heap memory is a dynamically allocated memory and the stack is done by the operating system in a typical case. Each thread has some stack memory. It is the last in-first-out algorithm, also called lethal. It's very simple, it's super fast and the memory that is allocated on the stack is automatically reclaimed as soon as the function exits. When we compare that with the heap memory, there is a lot of things going on because the stack memory normally only contains pointers to the function that is currently running. And if you push onto the stack, the currently running pointer, then we get to the heap, which is the dynamically allocated memory. And here we have a nice overview from V8 that is done for the Times of India.

2. Memory Allocation and Garbage Collection

Short description:

There is a lot of things going on. JSObjects are allocated, we have JavaScript source code, optimized code, we have regular expression code, strings, and much, much more. The heap memory, in V8 in particular, is divided into three areas. We have the young generation, intermediate generation, and old generation. The garbage collector is responsible for automatically freeing the memory we allocate, but it can sometimes go haywire.

There is a lot of things going on. JSObjects are allocated, we have JavaScript source code, optimized code, we have regular expression code, strings, and much, much more. So, this is the main memory for our program in this case.

And the heap memory, in V8 in particular, is divided into three areas. Again, we have the young generation. So, as soon as we allocate a new variable, let's say we say let the pool is the string test, then you are going to allocate the memory test and it's going to be put into the young generation. This memory is relatively small. Mostly JavaScript will have intermediate variables that you use to compute the next value immediately. And you do not need the variable as soon as you compute your next value. This is all done synchronously. So, we want to dispose all of that memory that we do not need as soon as possible. So, this young generation will only survive one so-called scavenge run. This is the first run where our program to create a memory is going to try to get rid of those not-anymore-used variables. And if any variable survives that run, it is then pushed into the intermediate generation. And if it survives the second run, then it's pushed into the old generation. This is the bigger part of the application. It's normally used for long and used variables. So, things that you reuse in each application that might hold pointers to a lot of things. And we use a different algorithm to free the memory in this case. And the algorithm used there is called MarkSweep. We start off from our root object. The root object in a browser would be the window object, and in Node.js, it's global. While in modern JavaScript, it would be in both, just global disk. And we use a somewhat like, similar to a recursive algorithm where we start off with the root object and we just connect each dot. We try to connect each node that is somewhat connected to their root object. All other nodes, all other variables, or allocations are going to be freed. And this should be mostly ideal to free all the memory that is not used.

So, I already spoke about that we as a developer do not have to worry about the memory that we allocate because it's going to be automatically freed. What is done in the background is a so-called garbage collector is run and that garbage collector has a lot to do and at some point it might actually even go haywire and not, does not work as we anticipated it would be. So we have to look into that a little bit closer.

QnA

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

It's a Jungle Out There: What's Really Going on Inside Your Node_Modules Folder
Node Congress 2022Node Congress 2022
26 min
It's a Jungle Out There: What's Really Going on Inside Your Node_Modules Folder
Top Content
Do you know what’s really going on in your node_modules folder? Software supply chain attacks have exploded over the past 12 months and they’re only accelerating in 2022 and beyond. We’ll dive into examples of recent supply chain attacks and what concrete steps you can take to protect your team from this emerging threat.
You can check the slides for Feross' talk here.
Towards a Standard Library for JavaScript Runtimes
Node Congress 2022Node Congress 2022
34 min
Towards a Standard Library for JavaScript Runtimes
Top Content
You can check the slides for James' talk here.
ESM Loaders: Enhancing Module Loading in Node.js
JSNation 2023JSNation 2023
22 min
ESM Loaders: Enhancing Module Loading in Node.js
Native ESM support for Node.js was a chance for the Node.js project to release official support for enhancing the module loading experience, to enable use cases such as on the fly transpilation, module stubbing, support for loading modules from HTTP, and monitoring.
While CommonJS has support for all this, it was never officially supported and was done by hacking into the Node.js runtime code. ESM has fixed all this. We will look at the architecture of ESM loading in Node.js, and discuss the loader API that supports enhancing it. We will also look into advanced features such as loader chaining and off thread execution.
Out of the Box Node.js Diagnostics
Node Congress 2022Node Congress 2022
34 min
Out of the Box Node.js Diagnostics
In the early years of Node.js, diagnostics and debugging were considerable pain points. Modern versions of Node have improved considerably in these areas. Features like async stack traces, heap snapshots, and CPU profiling no longer require third party modules or modifications to application source code. This talk explores the various diagnostic features that have recently been built into Node.
You can check the slides for Colin's talk here. 
Node.js Compatibility in Deno
Node Congress 2022Node Congress 2022
34 min
Node.js Compatibility in Deno
Can Deno run apps and libraries authored for Node.js? What are the tradeoffs? How does it work? What’s next?
Multithreaded Logging with Pino
JSNation Live 2021JSNation Live 2021
19 min
Multithreaded Logging with Pino
Top Content
Almost every developer thinks that adding one more log line would not decrease the performance of their server... until logging becomes the biggest bottleneck for their systems! We created one of the fastest JSON loggers for Node.js: pino. One of our key decisions was to remove all "transport" to another process (or infrastructure): it reduced both CPU and memory consumption, removing any bottleneck from logging. However, this created friction and lowered the developer experience of using Pino and in-process transports is the most asked feature our user.In the upcoming version 7, we will solve this problem and increase throughput at the same time: we are introducing pino.transport() to start a worker thread that you can use to transfer your logs safely to other destinations, without sacrificing neither performance nor the developer experience.

Workshops on related topic

Node.js Masterclass
Node Congress 2023Node Congress 2023
109 min
Node.js Masterclass
Top Content
Workshop
Matteo Collina
Matteo Collina
Have you ever struggled with designing and structuring your Node.js applications? Building applications that are well organised, testable and extendable is not always easy. It can often turn out to be a lot more complicated than you expect it to be. In this live event Matteo will show you how he builds Node.js applications from scratch. You’ll learn how he approaches application design, and the philosophies that he applies to create modular, maintainable and effective applications.

Level: intermediate
Build and Deploy a Backend With Fastify & Platformatic
JSNation 2023JSNation 2023
104 min
Build and Deploy a Backend With Fastify & Platformatic
WorkshopFree
Matteo Collina
Matteo Collina
Platformatic allows you to rapidly develop GraphQL and REST APIs with minimal effort. The best part is that it also allows you to unleash the full potential of Node.js and Fastify whenever you need to. You can fully customise a Platformatic application by writing your own additional features and plugins. In the workshop, we’ll cover both our Open Source modules and our Cloud offering:- Platformatic OSS (open-source software) — Tools and libraries for rapidly building robust applications with Node.js (https://oss.platformatic.dev/).- Platformatic Cloud (currently in beta) — Our hosting platform that includes features such as preview apps, built-in metrics and integration with your Git flow (https://platformatic.dev/). 
In this workshop you'll learn how to develop APIs with Fastify and deploy them to the Platformatic Cloud.
0 to Auth in an Hour Using NodeJS SDK
Node Congress 2023Node Congress 2023
63 min
0 to Auth in an Hour Using NodeJS SDK
WorkshopFree
Asaf Shen
Asaf Shen
Passwordless authentication may seem complex, but it is simple to add it to any app using the right tool.
We will enhance a full-stack JS application (Node.JS backend + React frontend) to authenticate users with OAuth (social login) and One Time Passwords (email), including:- User authentication - Managing user interactions, returning session / refresh JWTs- Session management and validation - Storing the session for subsequent client requests, validating / refreshing sessions
At the end of the workshop, we will also touch on another approach to code authentication using frontend Descope Flows (drag-and-drop workflows), while keeping only session validation in the backend. With this, we will also show how easy it is to enable biometrics and other passwordless authentication methods.
Table of contents- A quick intro to core authentication concepts- Coding- Why passwordless matters
Prerequisites- IDE for your choice- Node 18 or higher
Building a Hyper Fast Web Server with Deno
JSNation Live 2021JSNation Live 2021
156 min
Building a Hyper Fast Web Server with Deno
WorkshopFree
Matt Landers
Will Johnston
2 authors
Deno 1.9 introduced a new web server API that takes advantage of Hyper, a fast and correct HTTP implementation for Rust. Using this API instead of the std/http implementation increases performance and provides support for HTTP2. In this workshop, learn how to create a web server utilizing Hyper under the hood and boost the performance for your web apps.
GraphQL - From Zero to Hero in 3 hours
React Summit 2022React Summit 2022
164 min
GraphQL - From Zero to Hero in 3 hours
Workshop
Pawel Sawicki
Pawel Sawicki
How to build a fullstack GraphQL application (Postgres + NestJs + React) in the shortest time possible.
All beginnings are hard. Even harder than choosing the technology is often developing a suitable architecture. Especially when it comes to GraphQL.
In this workshop, you will get a variety of best practices that you would normally have to work through over a number of projects - all in just three hours.
If you've always wanted to participate in a hackathon to get something up and running in the shortest amount of time - then take an active part in this workshop, and participate in the thought processes of the trainer.
Mastering Node.js Test Runner
TestJS Summit 2023TestJS Summit 2023
78 min
Mastering Node.js Test Runner
Workshop
Marco Ippolito
Marco Ippolito
Node.js test runner is modern, fast, and doesn't require additional libraries, but understanding and using it well can be tricky. You will learn how to use Node.js test runner to its full potential. We'll show you how it compares to other tools, how to set it up, and how to run your tests effectively. During the workshop, we'll do exercises to help you get comfortable with filtering, using native assertions, running tests in parallel, using CLI, and more. We'll also talk about working with TypeScript, making custom reports, and code coverage.