Technical SEO & JavaScript

Bookmark

The web is an ever-changing platform and so are the websites we create. With new technologies and possibilities come new challenges for both developers and SEOs. In this session, we will talk about how developers and SEOs can become a team and solve challenges together. We will look into a few scenarios where this collaboration can bring success to a project.



Transcription


Hello and welcome everybody out there. I'm super excited to be talking about technical seo to all of you today. Specifically, I want to focus a little bit on vue.js, but basically none of the things that I talk about are very, very vue-specific. If you want to learn more about vue-specific seo, then I'll show you a few resources after the talk. So when we talk about seo, then a lot of people are like, what does it even mean? Besides the obvious meaning, which is search engine optimization, it seems that it's very non-obvious to a lot of people what this is, and a bunch of people think it's basically snake oil and trying to sell links and stuff like that. It's not that. Fundamentally, what seo is about is making sure that what you are creating, what you are putting online there, is actually visible to people looking for it. Because if you are going online to seek something that you don't know yet, then you're going to your search engine of choice, and you search for whatever it is that you need, and then you find potential web applications or websites that are serving that specific purpose. To do so, you need to make sure that search engines can actually find your things, and thus people using search engines can actually find the things that you build. And that entails making sure that the robots, which are the search engines, because they are computer programs that consume your content, you need to make sure that these robots actually can process and understand what you're putting out there on the web. And for that, there is a bunch of stuff that needs to be done. And a bunch of it is technical, but not all of it is technical. So seo is a very broad field, similar to how software engineering is a very broad field. You might do front end, you might do back end, you might do embedded systems, you might do game development. seo is not only technical, but we'll focus on the technical side today, as we are all developers. So when it comes to search engines, search engines fundamentally run a process. And this process can be broken down into multiple steps. The very first step is actually making HTTP requests to your server, to a URL that we found somewhere, and then fetching whatever comes back from your server. And we need to understand what it is. So is this a website about dogs? Is this a website about cats? Is this a website about boats? And then we would put that into a database, which is called the index. And whenever someone looks for, let's say, you make a cat website, if someone looks for cat pictures, then we have lots of images, lots of websites for cat pictures in our index. And we need to fetch them, and we need to figure out what is the best result for this user at this time. And that's the ranking process, where we figure out where each of the websites in our index goes on the list of results. And then we show these results to users. Now, as developers, we actually have fundamental impact on this process.

Because if you think about it, both the server side, as well as the client side, influence if a website or a web application can be crawled and indexed. If we can't make requests to certain pieces of content, or if the URL just gives us a 404 or a 500 error, then we can't really take it in and process it further. If we do, but then there's something in the javascript that prevents us from seeing any of the content, then we don't really know what the website is about. And we can't actually put that into the index either. And if it's not in the index, we can't really rank it. A lot of people are focusing on ranking when it comes to seo. And I don't think that's reasonable, because it's just a lot of factors. And ranking is also influenced by where is our user, how many other competitors are there, what are the competitors doing. So there's a constant flux in rankings. And I don't think it makes sense to look into those too much. I would focus more on the crawling and indexing side of things, because that's where you can have a lot of impact. Now, as I said, the process starts with crawling, which means we discover a URL somewhere. And then we make a request to that URL. URLs take many forms, but these are typical URLs that you probably have encountered yourself or have provided yourself. So you have a host, you have a path, and sometimes you have subsections. The thing with these hashes, or fragments as they are called technically, is that they are not meant to address different content. So page should have a lot of content, and then part of it is the content that is addressed by subsection. But it's not that if I have page and then I change the fragment that I get completely new content that I wouldn't have seen beforehand if I would have gone to slash page. Unfortunately, that's what a lot of people are doing. They are using hash-based routing, which is legitimate because it is a lot easier to set up for a local development server. But nonetheless, for seo purposes, this is not great. Because a crawler assumes that these two pages are the same as the home page here. The hash about should just be a part of what is already on the home page on slash. Same with an exclamation mark in there. We had a specific URL crawling scheme for such URLs, but we have deprecated it because we found it not to work that well.

So definitely make sure to use history api rather than hash-based routing. In 2019, 12% of the websites we crawled for the web analog unfortunately continued to use fragment URLs.

Another thing is that people are worried, is client-side rendering OK? And to which I say, yes, it is, because we have a render queue and we render websites. So client-side rendering works. Also, people are like, oh, but it takes a long time for Google to actually client-side render. That's not true. The queue time is really, really short. It has been at least since 2020. Also, we're using an evergreen Chrome. So if you're using ES6 or something, that is not a problem because we are updating our renderer to be the latest version of Chromium anyways.

Another thing that happens often in client-side rendered applications are soft errors. They happen because you go to a URL that doesn't exist, and you see an error message, and you are happy. But if you check, you actually see that the HTTP status that comes back is a successful request. And then there's an api request made. And then only if the api request returns an error, we are actually showing an error page. Unfortunately, that's not very great. It would be better to redirect to a URL that actually gives us a server-side error. That would prevent such soft 404 errors that might show up nastily in search results.

So with that in mind, you've seen a few things that you can already look into. And the two biggest issues are not using proper URLs for your routes and also making sure that you are using HTTP services correctly or you mitigate client-side 404 errors, such as shown in here. There's a lot more to consider. If you're interested, check out these resources. They have a lot more content and a lot more time to present that to you. And with that, I'd like to say thank you very, very much and enjoy the rest of the Lightning Talks.

♪♪♪

8 min
21 Oct, 2021

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

Workshops on related topic