Majestic

  • Site Explorer
    • Majestic
    • Récapitulatif
    • Domaines référents
    • Backlinks
    • * Nouveaux
    • * Disparus
    • Contexte
    • Intitulé de lien
    • Pages
    • Sujets
    • Link Graph
    • Sites liés
    • Outils Avancés
    • Author ExplorerBeta
    • Summary
    • Similar Profiles
    • Profile Backlinks
    • Attributions
  • Comparer
    • Récapitulatif
    • Backlink History
    • Flow Metrics History
    • Sujets
    • Clique Hunter
  • Outils pour les liens
    • Mon Majestic :
    • Activité récente
    • Rapports
    • Campagnes
    • Domaines vérifiés
    • OpenApps
    • Clés API
    • Mots-clés
    • N-grams Near Links
    • Keyword Checker
    • Search Explorer
    • Outils pour les liens
    • Bulk Backlinks
    • Neighbourhood Checker
    • Envoyer des URL
    • Expérimental
    • Regroupement d’index
    • Link Profile Fight
    • Liens mutuels
    • Solo Links
    • Rapport PDF
    • Typo Domain
    • TLD Checker Nouveaux
  • Free SEO Tools
    • Démarrer
    • Backlink Checker
    • Majestic Million
    • Modules d'extension Majestic
    • Google Sheets
    • Post Popularity
    • Social Explorer
  • Assistance
    • Blog Lien externe
    • Assistance
    • Démarrer
    • Outils
    • Subscriptions & Billing
    • Questions fréquentes
    • Glossaire
    • Guide de style
    • Vidéos didactiques
    • API Reference Guide Lien externe
    • Nous contacter
    • About Backlinks and SEO
    • SEO in 2026
    • The Majestic SEO Podcast
    • All Podcasts
    • What is Trust Flow?
    • Link Building Guides
  • Compte gratuit
  • Tarifs
  • Login
  • Language flag icon
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
    • 日本語
    • Nederlands
    • Polski
    • Português
    • 中文
  • Démarrer
  • Login
  • Tarifs
  • Compte gratuit
    • Récapitulatif
    • Domaines référents
    • Carte
    • Backlinks
    • Nouveaux
    • Disparus
    • Contexte
    • Intitulé de lien
    • Pages
    • Sujets
    • Link Graph
    • Sites liés
    • Outils Avancés
    • Récapitulatif
      Pro
    • Backlink History
      Pro
    • Flow Metrics History
      Pro
    • Sujets
      Pro
    • Clique Hunter
      Pro
  • Bulk Backlinks
    • N-grams Near Links
    • Keyword Checker
    • Search Explorer
      Pro
  • Neighbourhood Checker
    Pro
    • Regroupement d’index
      Pro
    • Link Profile Fight
      Pro
    • Liens mutuels
      Pro
    • Solo Links
      Pro
    • Rapport PDF
      Pro
    • Typo Domain
      Pro
    • TLD Checker Nouveaux
      Pro
  • Envoyer des URL
    • Summary
      Pro
    • Similar Profiles
      Pro
    • Profile Backlinks
      Pro
    • Attributions
      Pro
  • Rapports personnalisés
    Pro
    • Démarrer
    • Backlink Checker
    • Majestic Million
    • Modules d'extension Majestic
    • Google Sheets
    • Post Popularity
    • Social Explorer
    • Démarrer
    • Outils
    • Subscriptions & Billing
    • Questions fréquentes
    • Glossaire
    • Vidéos didactiques
    • API Reference Guide Lien externe
    • Nous contacter
    • Messages
    • The Company
    • Guide de style
    • Conditions générales
    • Privacy Policy
    • RGPD
    • Nous contacter
    • SEO in 2026
    • The Majestic SEO Podcast
    • All Podcasts
    • What is Trust Flow?
    • Link Building Guides
  • Blog Lien externe
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
    • 日本語
    • Nederlands
    • Polski
    • Português
    • 中文

Show AI crawlers what you want them to see

Arnout Hellemans

Arnout Hellemans advises that you don’t have to provide AI crawlers with full access to all of your content.

@hellemans    
Arnout Hellemans 2026 podcast cover with logo
More SEO in 2026 YouTube Podcast Playlist Link Spotify Podcast Playlist Link Audible Podcast Playlist Link Apple Podcast Playlist Link

Show AI crawlers what you want them to see

Arnout says: “People really need to look at their technical setup.

With that, I mean the rendered version versus the raw HTML – especially with a lot of AI crawlers not rendering yet, and also because I've seen a lot of discrepancies in that area.”

Is the rendered version typically quite different for different search engines?

“As with a lot of things in SEO, it depends.

Sometimes, when websites are built using JavaScript frameworks, the content is actually different in the raw HTML versus the rendered HTML.

Titles might be different. There might not be schema markup, there might be different headings, etc., because those can be changed by the execution of JavaScript.

That can severely impact the discoverability of your page.”

What does this mean for the use of JavaScript over the next few years?

“Crawling a website using a rendered version takes a lot of energy, because you actually need to render the pages. Most crawlers would rather just scrape the page, get the raw HTML, and get all the elements.

Currently, for a lot of AI crawlers – OpenAI, Perplexity, etc – it's just too expensive for them to do it. Microsoft and Google are doing it. The thing is, you can properly implement this with pre-rendering solutions and hybrid solutions. There are loads of ways to work around it, so it’s not the death of JavaScript, but I feel that JavaScript has had a lot of technical difficulties that feel scary for most people in SEO.

Most people don't even know how to do this. It took me a while to figure it out. It's definitely not the death of JavaScript, because JavaScript has given us a lot of interactive elements and all kinds of things. However, it's something people should be aware of, especially with all these new crawlers popping up.

You want your content to be seen by AI crawlers as well as search engines.”

What does an SEO need to do in order to determine how their website is seen by AI crawlers?

“For one, there's a free extension in Chrome called View Rendered Source, which will show you the difference between the raw HTML and the rendered source view.

The other thing is, if you use a crawler, look at the difference between the raw HTML and the rendered HTML. I really enjoy using Sitebulb for that because it will literally show you which links were added using JavaScript, what piece of content was changed because of JavaScript, which images are being rendered through the execution of JavaScript, etc.”

How do you determine how AI search engines see things differently, and what do you do to ensure that they have a better idea of what's on your web pages?

“Basically, they only look at the raw HTML. What you see in the raw HTML is what they can see.

If your images aren't visible in there, then they won't see the images. If your page title isn't filled in or there are no headings in the raw HTML, then that's what they are seeing.

In an ideal situation, the raw HTML and the rendered HTML are basically the same. Then, your web page will be a lot faster because no JavaScript execution is needed to render the page. Is that always possible? No, but you should get as close as possible because it makes everything faster, and there won't be any indexation problems. It's way easier.

Back in the day, we would just look at the raw HTML, and that was it. Nowadays it's a lot more difficult.”

As an SEO, do you need to prioritise certain elements that should be incorporated, or do you prioritise certain pages?

“The basics are the most important. It's making sure that the core content of the page can be read. This means the headings, page title, images, etc. – because that's what you want in the index. If certain elements are not working, or a footer is not working, that's not the biggest issue.

You need to prioritise having all the elements in the raw HTML, and they should not change.

What I have seen a lot is that, when the page gets rendered, the elements are still the same, but they've been taken away and inserted again, which makes search engines think, ‘What's happening here? I thought I had the H1, and now the H1 is somewhere else.’ It's still the same H1, but it is confusing the crawler.”

If you have a JavaScript menu, do you need to prioritise the inclusion of HTML links to all the other pages in your site, or is an XML sitemap sufficient?

“An XML sitemap is always a good idea. The challenge with links in these mega menus is when you switch off JavaScript.

Try doing it. Switch off JavaScript in your browser (using a NoScript plugin or whatever) and then try browsing your own website. It might be very hard.

In an ideal situation, you want the internal linking to keep working, but the biggest priority is getting the content and the right markup – the heading, the page title, etc. – in your content in the index. That's the most important part.”

Does this impact brand visibility?

“Yes, but most search engines will first crawl and index the raw HTML, then render the page, and then compare it to the original. Then it will think, ‘Should I overwrite this?’

Say you used JavaScript to insert structured data. That might be an issue because there might be a delay in the structured data being rendered. For instance, a product might have been out of stock, but now it's in stock on your website. However, because the page hasn't been rendered, Google still think it's out of stock.

That's why those particular elements are really important to have in your raw HTML, not only in your rendered.”

How do you know which elements are likely to have the biggest impact on rankings?

“Again, it depends.

Say you have review stars. If you're using JavaScript to insert that part of the code, it might not appear for all the pages immediately. Then, that is an important element.

Say your headings are changing; there are no headings in the raw HTML, but there are headings in the rendered. That will impact the ranking of a new article. Eventually, it will fix itself once the rendered version gets indexed. Initially, however, you won't benefit from the work you did.”

Does this mean it is even more important that your CMS incorporates the non-rendered version as readable by modern search engines?

“Yes. I see a lot of headless CMSs popping up, like Storyblok, and they usually have a JavaScript-based app. If you don’t use a pre-rendering solution – a server version that will render the page and serve a fully rendered page to both search engines and users – it will impact everything.

Any CMS can be adjusted to be able to serve this. It's a little more work, but it can be done. Most of the traditional CMSs use less JavaScript, so the core elements will just be there.

You should just be aware, and I feel a lot of people are not aware. I see a lot of use cases for React-based applications or websites, but there are also a lot of cases where you shouldn't.

If you're making loads of changes, you don't need a developer for everything. You are better off going for a fairly standard CMS out of the box – whether it's WordPress, Drupal, or one of the builders like Wix or Duda, rather than building something with the React front end, because that will create these problems.

We should be aware that this is happening, and I see a lot of people who are a little scared of doing SEO this way.”

Do you implement changes on a test-and-learn basis to analyse the impact of what you're doing, and how do you demonstrate that impact to stakeholders?

“First, you need to understand what's happening, using tools like View Rendered Source.

I've had projects where it would also be dependent on the user agent, and what IP location you are in/what country. It's difficult.

Once you've understood what the problem is, you can build a case around it fairly easily. If the structured data for a review snippet is gone for some pages, and it's still there on other pages, you can show that the click-through rate is a lot higher for the pages with it than without it, so it's highly likely that it is a result of this issue.”

How much of an SEO’s time should be spent on this?

“It depends. If you are going to work on a React-based platform, you should spend a lot of time fixing and monitoring this. Sometimes it breaks and you get unforeseen errors, like the pre-rendering stops working. Then, you have a big problem. Most people aren't looking at that.

It depends on your platform. If you don't use any JavaScript in the front end, or hardly any, it's less of an issue.”

Arnout, what's the key takeaway from the tip you shared today?

“You should not only look at the source of a page but also look at the rendered version.

View Rendered Source (the extension by Jon Hogg) is awesome for that. SiteBulb also has a great comparison in their crawler.

More people should just check this. I see a lot of people not checking this and failing to solve some issues.”

Arnout Hellemans is a Freelance Tech SEO and Analytics Consultant. Find out more over at OnlineMarkethink.com.

@hellemans    

Also with Arnout Hellemans

Arnout Hellemans 2025 podcast cover with logo
SEO in 2025
Satisfy queries by putting the user first

With AI-produced content everywhere, you need to be putting out exactly what your users want. Arnout Hellemans from OnlineMarkethink implores you to listen more closely to find out what that is.

Arnout Hellemans 2024 podcast cover with logo
2024 Additional Insight
2024 is the year of Query Satisfaction
Arnout Hellemans want’s SEOs to prioritize user query satisfaction, with a focus on bringing together content, UX, Core Web Vitals and EEAT.

Choose Your Own Learning Style

Webinar iconVideo

If you like to get up-close with your favourite SEO experts, these one-to-one interviews might just be for you.

Watch all of our episodes, FREE, on our dedicated SEO in 2026 playlist.

youtube Playlist Icon

Podcast iconPodcast

Maybe you are more of a listener than a watcher, or prefer to learn while you commute.

SEO in 2026 is available now via all the usual podcast platforms

Spotify Apple Podcasts Audible

Book iconBook

This is our favourite. Sometimes it's better to sit and relax with a nice book.

The best of our range of interviews is available right now as a physical copy and eBook.

Amazon US Amazon UK

Don't miss out

Opt-in to receive email updates.

It's the fastest way to find out more about SEO in 2026.


Pouvons-nous améliorer cette page pour vous ? Donnez-nous votre opinion

Fresh Index

Adresses URL différentes recensées 232 577 791 400
Adresses URL différentes trouvées 792 809 740 483
Plage de dates 11 janv. 2026 – 12 mai 2026
Dernière mise à jour Il y a 29 minutes

Historic Index

Adresses URL différentes recensées 4 502 566 935 407
Adresses URL différentes trouvées 21 743 308 221 308
Plage de dates 06 juin 2006 – 26 mars 2024
Dernière mise à jour 03 mai 2024

SOCIAL

  • LinkedIn
  • YouTube
  • Facebook
  • Bluesky
  • Twitter

ENTREPRISE

  • Blog Lien externe
  • Présentation
  • Conditions générales
  • Règlement en matière de confidentialité
  • RGPD
  • Nous contacter

OUTILS

  • Tarifs
  • Site Explorer
  • Comparer les domaines
  • Bulk Backlinks
  • Search Explorer
  • Developer API Lien externe

MAJESTIC POUR

  • Trust Flow
  • Valeurs de Flow Metric
  • Link Context
  • Backlink Checker
  • Découverte d’influenceurs
  • Entreprise Lien externe

PODCASTS & PUBLICATIONS

  • The Majestic SEO Podcast
  • SEO in 2026
  • SEO in 2025
  • SEO in 2024
  • SEO en 2023
  • SEO en 2022
  • All Podcasts
top ^